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DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 
ACYLTRANSFERASE AND USES THEREOF 

5 

This application is a Continuation-In-Part of U.S. Serial 
No. 08/657,620, filed May 30, 1996, the content of which 
is incorporated by reference into this application. 

10 Throughout this application, various publications are 

referenced by Arabic numerals. Full citations for these 
publications may be found listed at the end of the 
specification. The disclosures of these publications in 
their entireties are hereby incorporated by reference 

15 into this application in order to more fully describe the 

state of the art as known to those skilled therein. 

Background of the Invention 

Cholesterol or related sterols, required for the 

20 viability of eukaryotic cells, exist in the free form or 

as esters conjugated to fatty acids. The concentration 
of free sterol determines the fluidity of eukaryotic cell 
membranes, whereas esterified sterols cannot participate 
in membrane assembly. The esterif ication of 

25 intracellular sterol, mediated in mammals by the 

membrane-bound enzyme, acylcoenzyme A: cholesterol 
acyltransferase, is thus a critical homeostatic 
determinant of membrane function (1, 2). For example, 
cholesterol depletion of the rough endoplasmic reticulum 

30 (ER) relative to the smooth ER (3), may modulate protein 

translocation or membrane-associated transcriptional 
activators such as the Sterol Response Element Binding 
proteins (SREBP, 4) . In addition, production of 

cholesterol ester (CE) by acylcoenzyme A: cholesterol 

35 acyltransferase in the rough ER may influence the 

transport of sterol between intracellular pools. Similar 
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esterif ication activities have been observed in other 
eukaryotes such as plants and yeasts (5) . 

Elevations in acylcoenzyme A: cholesterol acyltransf erase 
5 activity perturb several pathways that contribute to 

hyperlipidemia and atherosclerosis. Sterol 
esterif ication modifies the activity of the low density 
lipoprotein (LDL) receptor and alters serum lipoprotein 
composition to be pro-atherogenic (6, 7) . it may also be 

10 a rate limiting step in intestinal sterol absorption (8). 

Furthermore, CE deposition in the arterial wall is an 
important initial step in atherogenesis (9). The 
understanding of the acylcoenzyme A: cholesterol 
acyltransf erase reaction has been hampered by the 

15 difficulty of biochemical purification and by a poor 

grasp of the relevant genetic determinants. A human 
acylcoenzyme A: cholesterol acyltransf erase I gene from 
macrophages was identified by complementation of Chinese 
Hamster Ovary cell lines deficient in acylcoenzyme A: 

20 cholesterol acyltransf erase activity (10) and was 

functionally expressed in insect cells devoid of 
endogenous activity (11) . 



25 
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Summary of the Inventign 

This invention provides an isolated nucleic acid which 
encodes an acylcoenzyme A: cholesterol acyltransf erase II 
5 or an acylcoenzyme A: cholesterol acyltransf erase III. 

This invention also provides a vector which includes the 
isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransf erase II or an acylcoenzyme A: 
10 cholesterol acyltransf erase III and a host vector system 

which includes a vector. 

This invention also provides a method of producing a 
polypeptide which comprises growing such host vector 

15 system of claim 14 under suitable conditions permitting 

production of the polypeptide and recovering the 
polypeptide so produced. This invention also provides a 
purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or an acylcoenzyme A: cholesterol 

20 acyltransf erase III. 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 

25 nucleic acid which encodes a wildtype acylcoenzyme A: 

cholesterol acyltransf erase II or an acylcoenzyme A: 
cholesterol acyltransf erase III without hybridizing to a 
nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or an acylcoenzyme A: 

30 cholesterol acyltransf erase III. This invention also 

provides an oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic acid 
which encodes a mutant acylcoenzyme A: cholesterol 

35 acyltransf erase II or an acylcoenzyme A: cholesterol 
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acyltransf erase III without hybridizing to a nucleic acid 
which encodes a wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or an acylcoenzyme A: cholesterol 
acyltransf erase III. 

5 

This invention also provides a method for determining 
whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterif ication of sterol and for treating a subject who 
10 has an imbalance in sterol levels due to a defect in 

esterif ication of sterol. 



This invention also provides methods for inhibiting 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 
15 or an acylcoenzyme A: cholesterol acyltransf erase III in 

a subject. 

This invention also provides a method for identifying a 
chemical compound which is capable of inhibiting 
20 acylcoenzyme A: cholesterol acyltransf erase II or an 

acylcoenzyme A: cholesterol acyltransf erase III in a 
subject and a pharmaceutical composition comprising of 
the chemical compound so identified, 

25 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransf erase II or an 
acylcoenzyme A: cholesterol acyltransf erase III. 



30 
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Brief Description of the Figures 

Abbreviations: The amino acid residues are abbreviated 
as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, 
Gly; H, His; I, lie; K, Lys; L, Leu; M, Met; N, Asn; P, 
Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y, Tyr. CON: consensus sequence. 

Figures 1A and IB. Protein sequence alignments predicted 
from candidate genes for the human acyl coenzyme A: 
cholesterol acyl transferase gene I, the yeast homologs, 
acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I and acylcoenzyme A: cholesterol acyltransferase- 
related enzyme 11, and a consensus sequence of all three 
1 5 sequences . 

Identical residues between all the sequences 
are in bold face. Residues of the candidate 
leucine zipper heptad motif are italicized. 
Potential transmembrane domains were identified 

20 at residues 132 to 155 and 460 to 483; 186 to 

202 and 406 to 421; and 215 to 231 and 439 to 
451, for human acylcoenzyme A: cholesterol 
acyltransf erase (Sequence I.D. No.: 2), 
acylcoenzyme A: cholesterol acyltransf erase- 

25 related enzyme I (Sequence I.D. No.: 4) and 

acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II (Sequence I.D. No.: 6), 
respectively. The firefly luciferase signature 
sequences identified in human acylcoenzyme A: 

30 cholesterol acyltransf erase I (10) were not 

conserved in the yeast genes. CON (Sequence 
I.D. No.: 13) denotes the consensus sequence 
between the sequences of human acylcoenzyme A: 
cholesterol acyltransf erase, acylcoenzyme A: 
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cholesterol acyltransf erase-related enzyme I 
and acy 1 coenz yme A: cholesterol 

acyltransf erase-related enzyme II. R07932 
denotes the partial sequence of another human 
5 acylcoenzyme A: cholesterol acyltransf erase 

candidate cDNA (residues 500 to 600) (Sequence 
I.D. No.: 14). The asterisks indicate the 
residues in R07932 identical to those of the 
other sequences. 

10 1A. Alignment of amino acid residues 1-362 of 

acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme II, human acylcoenzyme A: 

15 cholesterol acyltransf erase and CON. 

IB. Alignment of amino acid residues 363-611 of 

acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyltransf erase- 

20 related enzyme II, human acylcoenzyme A: 

cholesterol acyltransf erase and CON. 

Figures 2A, 2B, 2C, 2D and 2£. Construction and analysis 
of acylcoenzyme A: cholesterol acyltransf erase genes and 
25 deletion mutants. 

2A. The arelANA deletion. The schematic depicts a 

fragment from yeast chromosome III in plasmid 
pH3<34). Strategic restriction endonucleases 
are indicated (H, Hind III; B, Bam HI) . 
30 2B. The autoradiogram depicts Bam HI digested DNA 

from wild-type or disrupted diploid strains 
probed with the 2993-bp Bam-HI fragment. This 
produced a fragment corresponding to the 
wild-type acylcoenzyme A: cholesterol 
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acyltransf erase-related enzyme I locus and a 
1984-bp fragment characterizing the arelA NA 
allele. The diploid is heterozygous for the 
acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme I deletion. 

Reduced stringency hybridization of yeast 
genomic DNA with acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I coding 
sequences. Genomic DNA from wild-type or 
KREl/arelANA diploids were reprobed with an Nhe 
I-Avr II fragment corresponding to the 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I open reading frame ("ORF") . 
Hybridizations and washes were performed at 
60 °c in the absence of formamide. 
The are2A deletion. In step 1, PCR amplifying 
oligonucleotides, KO-5' and KO-3' and a LEU2 
template were used to produce the selectable 
yeast gene flanked at the 5' and 3' ends by 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II. In step 2, this was used to 
direct homologous recombination at acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II by transformation of a diploid strain and 
selection for leucine protrophy. In step 3, 
integrants to acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II were 
identified by a PCR reaction using 
oligonucleotides flanking ARE2 (are2-5' and 
are2-3') and a 3' amplimer within LEU2 (L2-3'). 



2E, 



A 999-bp fragment identifies are2A, as shown in 
the ethidium bromide stained agarose gel. The 
wild-type fragment (2206-bp) is also produced 
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in the same reaction. Leucine prototrophic 
trans formants with deletions of acylcoenzyme A: 
cholesterol acyltransf erase-related enzyme II 
were obtained at a frequency of -2%. M 
5 indicates the 50-2,000-bp ladder markers 

(Bio-Rad Laboratories) . 



Figures 3A and 3B. Fluorescent staining of triglyceride 
and sterol ester. 

10 The cells were grown in YEPD to stationary 

phase, washed with deionized H ? 0, and incubated 
with 1 //g/ml Nile Red (1 mg/ml in acetone) . 
Fluorescent images were obtained with a BioRad 
MRC600 laser scanning confocal microscope 

15 (BioRad Microscience, Hercules, CA) on an 

inverted Zeiss Atiovert microscope (Zeiss, 
OberKochem, Germany) using 63X (NA1.4) Zeiss 
Plan-apo infinity corrected objective. Samples 
were illuminated with the 488nm line from an 

20 argon ion laser and the fluorescence was 

visualized with a 540nm dichroic mirror and 
550nm long-pass emission filter. Staining of 
the cytoplasmic lipid droplets was sensitive to 
treatment with isopropanol, proving them to be 

25 lipid in nature. 

3A. Wild-type cells. 

3B. arelA NAare2A double mutant cells. 



Figures 4A, 4B, 4C and 4D. Neutral lipid and sterol 
30 biosynthesis in ARE deletion mutants. 

Strain genotypes are as described in the text; 
dpm/mg dry weight: disintegrations per minute 
per milligram of dry weight of cells. 
4A. Triglyceride biosynthesis. Total lipids were 
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extracted from cells grown in media containing 
3 H-oleate and analyzed by thin-layer 
chromatography . 

Sterol ester biosynthesis. Total lipids were 
extracted from cells grown in media containing 
3 H-oleate and analyzed by thin-layer 
chromatography . 

Sterol ester biosynthesis in wild-type and 
mutant cells transformed with vector control 
(black box) or acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme I 

over-expression plasmids,, YEp3-16 (increased 
copy number, shaded box) and pADH5-36 
(transcription from the ADH promoter, open 
15 boxes) . Cells were grown in selective media to 

maintain the acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme I expression 
plasmids. Lipids were labeled, extracted and 
analyzed as above. 
20 4D. Sterol biosynthesis in acylcoenzyme A: 

cholesterol acyltransf erase-related enzyme 
deletion mutants. Lipids were labeled in 
synthetic complete media containing [1- 14 C] 
acetate, saponified and extracted with hexane 
25 an <* subjected to thin layer chromatography 

analysis. The data is representative of three 
separate experiments and expressed as the ratio 
of incorporation into sterols to incorporation 
into fatty acids. 

30 

Figures 5A, SB, 5C, 5D, 5E and 5F. The nucleic acid and 
amino acid or predicted amino acid sequences. 
5A-1 - 5A-3. 

The nucleic acid sequence of human acylcoenzyme 
35 A: cholesterol acyltransf erase I designated 
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Sequence ID No.: 1. The amino acid sequence of 
human acylcoenzyme A: cholesterol 

acyltransf erase I designated Sequence ID No. : 
2 . 

5 5A-1. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 

acyltransf erase I from nucleic acid 
bases 1-1624. Amino acid sequence of 
human acylcoenzyme A: cholesterol 
10 acyltransf erase I from amino acid 

residues 1-76. 
5A-2. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 

acyltransf erase I from nucleic acid 
15 bases 1625-2524. Amino acid sequence 

of human acylcoenzyme A: cholesterol 
acyltransf erase I from amino acid 
residues 77-376. 
5A-3. Nucleic acid sequence of human 

20 acylcoenzyme A: cholesterol 

acyltransf erase I from nucleic acid 
bases 2525-3649. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acyltransf erase I from amino acid 
25 residues 377-551. 

5B-1 - 5B-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransf erase-related enzyme 
I designated Sequence ID No.: 3. The amino 
30 acid sequence of yeast acylcoenzyme A: 

cholesterol acyltransf erase-related enzyme I 
designated Sequence ID No.: 4. 

5B-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransf erase- 
35 related enzyme I from nucleic acid 
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bases 1-1289. Amino acid sequence of 
acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I from 
amino acid residues 1-209. 

5B-2. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 1290-2114. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 210-484. 

5B-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 2115-2601. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme 1 from 
amino acid residues 485-611. 

-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II designated Sequence ID No.: 5. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II 
designated Sequence ID No.: 6. 

5C-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
bases 1-1061. Amino acid sequence of 
acylcoenzyme A: cholesterol 

acyltransferase-related enzyme II 
from amino acid residues 1-238. 

5C-2. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
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bases 1062-1961. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme II 
from amino acid residues 239-538. 
5 5C-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransf erase- 
related enzyme II from nucleic acid 
bases 1962-2421. Amino acid sequence 
of acylcoenzyme A: cholesterol 
10 acyltransf erase-related enzyme II 

from amino acid residues 539-643. 

5D. The nucleic acid sequence of mouse acylcoenzyme 

A: cholesterol acyltransf erase II designated 
15 Sequence ID No.: 11. The amino acid sequence 

of mouse acylcoenzyme A: cholesterol 
acyltransf erase II designated Sequence ID No.: 
12. 

2 0 Figure 6A. A restriction map of the expression vector 

YepAB-ACAT2 . 

Figure 6B and 6C . Expression of human macrophage ACAT 
in pRS426GP. 

25 6B. The ACAT open reading frame was inserted 

at the AfotI and SacI sites, downstream of 
the promoter of the GAL1/10 gene 
(GALl/lOp) as described in the text to 
produce pRS426-ACAT. URA3 and Amp r denote 

30 selectable markers for yeast and E. coli 

respectively. The yeast and bacterial 
origins of replication (2/im and ori, 
respectively ) are indicated. 
6C . Immunoblot of human ACAT in protein 
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extracts from cells transformed with 
PRS426-ACAT. Double mutant cells (arel' 
are2~) , transformed with pRS426-ACAT 
(hACAT) or with pRS426GP (vector), were 
5 induced by growth in galactose. Proteins 

were analyzed by immunoblotting . 
Equivalent amounts of protein extracts 
from mouse adrenal cells were loaded for 
comparison. Molecular weight reference 

10 markers (BioRad) are indicated (M) . The 

arrow indicates the position of the DM10 
immunoreactive product in extracts from 
murine adrenals. The expressed form of 
hACAT in yeast is of coincident mobility. 

15 Figures 7A and 7B. Multiple human tissue Northern 

analysis of poly (A) + RNAs probed with 32 P-labeled cDNA 
CI. 

7A - Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransf erase II 
20 using a wildtype acylcoenzyme A: cholesterol 

acyltransf erase II specific probe. 
7B - Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransf erase I 
using a wildtype acylcoenzyme A: cholesterol 
25 acyltransf erase I specific probe. 

Figure 8A, 8B, 8C and 8D. Tissue specific expression of 
ARGP1 and hACAT . 

8A and 8B. Multiple tissue Northerns (Clontech) with 

indicated samples were probed with an 
ARGP1 specific probe as described in the 
text . 

8C and 8D. The same blots were also analyzed using a 

hACAT specific probe. The first panel is 



30 
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identical to that published by Chang et al 
(8) . The second panel is the same blot as 
in A and B, probed with the ACAT cDNA 1600 
bp probe . 

Figure 9. Fetal Tissue specific expression of AGRP2. 

Multiple tissue Northerns of fetal tissue (Clontech) 
with indicated samples, were probed with and AGRP2 
specific probe as described in the text. 



Figure 10. Cultured cell expression of AGRP1 . 

RNA samples from HepG2 and CV1 were reverse 
transcribed and PCR amplified as described in the 
text. P indicate a plasmid template control. The 
15 blank lanes represent water or no RT controls. 

Figure 11. Sequence comparison of human ACAT and AGKP1 
Figure 12. Sequence comparison of human ACAT and AGRP2 

20 

Figure 13. Phylogenetic Comparisons of ACAT like 
molecules . 

The sequences shown were identified in genome 
databases and aligned based on protein sequence 
25 using GCG Inc software (pileup) . They were 

subsequently arranged to their sequence conservation 
to determine approximate evolutionary relatedness. 

Figure 14 . Conserved motifs in ACAT relate gene 
30 products. 

Figure 15A and 15B. Nucleotide and predicted protein 
sequence of ARGP1 . 
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Figure 16. Nucleotide and predicted protein sequence of 
ARGP2. 
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Detailed Description of the Invention 

Throughout this application, references to specific 
nucleotides are to nucleotides present on the coding 
5 strand of the nucleic acid. The following standard 

abbreviations are used throughout the specification to 
indicate specific nucleotides: 

C=cytosine A=adenosine 
10 T=thymidine G=guanosine 

A "gene" means a nucleic acid molecule, the sequence of 
which includes all the information required for the 
normal regulated production of a particular protein, 
15 including the structural coding sequence, promoters and 

enhancers . 

The nucleic acids or oligonucleotides of the subject 
invention also include nucleic acids or oligonucleotides 

20 coding for polypeptide analogs, fragments or derivatives 

which differ from naturally-occurring forms in terms of 
the identity or location of one or more amino acid 
residues (deletion analogs containing less than all of 
the residues specified for the protein, substitution 

25 analogs wherein one or more residues specified are 

replaced by other residues and addition analogs where in 
one or more amino acid residues is added to a terminal or 
medial portion of the polypeptides) and which share some 
or all properties of naturally-occurring forms. These 

30 nucleic acids or oligonucleotides include: the 

incorporation of codons "preferred" for expression by 
selected non-mammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or intermediate 

35 DNA sequences that facilitate construction of readily 
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expressed vectors. 



10 



The nucleic acids and oligonucleotides described and 
claimed herein are useful for the information which they 
provide concerning the amino acid sequence of the 
polypeptide and as products for the large scale synthesis 
of the polypeptide by a variety of recombinant 
techniques. The molecule is useful for generating new 
cloning and expression vectors, transformed and 
transfected prokaryotic and eukaryotic host cells, and 
new and useful methods for cultured growth of such host 
cells capable of expression of the polypeptide and 
related products. 

15 An isolated nucleic acid which encodes an acylcoenzyme A: 

cholesterol acyltransf erase II. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
Specifically, the isolated nucleic acid has the sequence 
designated Seq. I.D. No.: 7. The isolated nucleic acid 

20 encodes a human wildtype acylcoenzyme A: cholesterol 

acyltransferase II having substantially the same amino 
acid sequence as the sequence designated Seq. I.D. No.: 
8. Specifically the isolated nucleic acid has the 
sequence designated Seq. I.D. No.: 11. The isolated 

25 nucleic acid encodes a mouse wildtype acylcoenzyme A: 

cholesterol acyltransferase II having substantially the 
same amino acid sequence as the sequence designated Seq. 
I.D. No.: 12. Further, the isolated nucleic acid of 
encodes a mutant acylcoenzyme A: cholesterol 

30 acyltransferase II. 



An isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransferase III. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
Specifically, the isolated nucleic acid has the sequence 
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as set forth in Fig. 16. The isolated nucleic acid 
encodes a human wildtype acylcoenzyme A: cholesterol 
acyltransf erase III having substantially the same amino 
acid sequence as set forth in Fig. 16. Further, the 
5 isolated nucleic acid of encodes a mutant acylcoenzyme A: 

cholesterol acyltransf erase III. 



As used in this application, "acylcoenzyme A: cholesterol 
acyltransf erase III" means and includes any polypeptide 

10 having acylcoenzyme A: cholesterol acyltransf erase III 

activity and having an amino acid sequence homologous to 
the amino acid sequence of human acylcoenzyme A: 
cholesterol acyltransf erase II (the sequence of which is 
set forth in Fig. 15) . Thus, this term includes any such 

15 polypeptide whether naturally occurring and obtained by 

purification from natural sources or non-naturally 
occurring and obtained synthetically/ e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 

20 same as, or identical to the sequence of any mammalian 

homolog of the human polypeptide, e.g. murine, bovine, 
porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 

2 5 nonmutants or nonvariants . 

As used in this application/ "acylcoenzyme A: cholesterol 
acyltransf erase II" means and includes any polypeptide 
having acylcoenzyme A: cholesterol acyltransf erase II 

30 activity and having an amino acid sequence homologous to 

the amino acid sequence of human acylcoenzyme A: 
cholesterol acyltransf erase III (the sequence of which is 
set forth in Fig. 16) . Thus, this term includes any such 
polypeptide whether naturally occurring and obtained by 

35 purification from natural sources or non-naturally 
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occurring and obtained synthetically, e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 
same as, or identical to the sequence of any mammalian 
homolog of the human polypeptide, e.g. murine, bovine, 
porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 
nonmutants or nonvariants. 

The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyltransf erase II, but which 
do not produce phenotypic changes. 

The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyltransf erase III, but 
which do not produce phenotypic changes. 

The nucleic acid of the subject invention also include 
nucleic acids that encode for polypeptide analogs, 
fragments or derivatives which differ from naturally- 
occurring forms in terms of the identity or location of 
one or more amino acid residues (including deletion 
analogs containing less than all of the residues 
specified for the protein, substitution analogs wherein 
one or more residues specified are replaced by other 
residues and addition analogs wherein one or more amino 
acid residues is added to a terminal or medial portion of 
the polypeptides) and which share some or all properties 
of the naturally-occurring forms. 



35 



The polypeptide of the subject invention also includes 
analogs, fragments or derivatives which differ from 
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naturally-occurring forms, but having acylcoenzyme A: 
cholesterol acyl transferase activity. 

This invention also provides a vector comprising an 
5 isolated nucleic acid encoding acylcoenzyme A: 

cholesterol acyltransf erase II or III. The isolated 
nucleic acid of the vectors is operatively linked to a 
promoter of RNA transcription which maybe, or is 
identical to, a bacterial, yeast, insect or mammalian 
10 promoter. The vector may be a plasmid, cosmid, yeast 

artificial chromosome (YAC) , bacteriophage or eukaryotic 
viral DNA. Specifically, this invention provides a 
vector designated Y e p AB - AC AT 2 (Figure 6) . 

15 Further other numerous vector backbones known in the art 

as useful for expressing proteins may be employed. Such 
vectors include but are not limited to: adenovirus, 
simian virus 40 (SV40) , cytomegalovirus (CMV) , mouse 
mammary tumor virus (MMTV) , Moloney murine leukemia 

20 virus, murine sarcoma virus, and Rous sarcoma virus, DNA 

delivery systems, i.e liposomes, and expression plasmid 
delivery systems. 



This invention also provides a vector system for the 
25 production of a polypeptide which comprises the vector in 

a suitable host. Suitable host includes a cell which 
includes, but is not limited, prokaryotic or eukaryotic 
cells, e.g. bacterial cells (including gram positive 
cells) , yeast cells, fungal cells, insect cells and 
30 animal cells. 



Suitable animal cells include, but are not limited to, 
HeLa cells, Cos cells, CV1 cells and various primary 
mammalian cells. Numerous mammalian cells may be used as 
35 hosts, including, but not limited to, the mouse 
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fibroblast cell NIH 3T3, CHO cells, Ltk" cells, etc. 
Expression plasmids such as that described supra may be 
used to transfect mammalian cells by methods well known 
in the art such as calcium phosphate precipitation, 
5 electroporation . 

This invention also provides a method for producing a 
polypeptide (e.g. acylcoenzyme A: cholesterol 
acyltransferase) which comprises growing a host vector 

0 system under suitable conditions permitting production of 

the polypeptide and recovering the polypeptide so 
produced. Methods of recovering polypeptides produced in 
such host vector systems are well-known in the art and 
typically include steps involving cell lysis, 

5 solubilization and chromatography. 

This invention also provides a method of obtaining a 
polypeptide in purified form which comprises : (a) 
introducing a vector, as described above, into a suitable 

3 host cell; (b) culturing the resulting cell so as to 

produce the polypeptide; (c) recovering the polypeptide 
produced in step (b) ; and (d) purifying the polypeptide 
so recovered. As discussed above the vector may include 
a plasmid, cosmid, yeast artificial chromosome, 

3 bacteriophage or eukaryotic viral DNA. Also, the host 

cell may be a bacterial cell (including gram positive 
cells), yeast cell, fungal cell, insect cell or animal 
cell. Suitable animals cells include, but are not 
limited to HeLa cells, Cos Cells, CV1 cells and various 

> primary mammalian cells. Culturing methods useful for 

permitting transformed or transfected host cells to 
produce polypeptides are well known in the art as are the 
methods for recovering polypeptides from such cells and 
for purifying them. 
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Using the aforementioned method, this invention also 
provides a purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III and a purified mutant 
acylcoenzyme A: cholesterol acyltransf erase II or III. 

5 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 
nucleic acid which encodes a wildtype acylcoenzyme A: 

10 cholesterol acyltransf erase II or III without hybridizing 

to a nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or III. Further, this 
invention also provides an oligonucleotide of at least 15 
nucleotides capable of specifically hybridizing with a 

15 unique sequence of nucleotides present within the nucleic 

acid which encodes a mutant acylcoenzyme A: cholesterol 
acyltransf erase II or III without hybridizing to a 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III. These 

20 oligonucleotide DNA or RNA. Such oligonucleotides may be 

used in accordance with well known standard methods for 
known purposes, for example, to detect the presence in a 
sample of DNA which will hybridize thereto. 

25 The oligonucleotides include, but are not limited to, 

oligonucleotides that hybridize to mRNA encoding 
acylcoenzyme A: cholesterol acyltransf erase II or III so 
as to prevent translation of the protein, 

30 This invention also provides a nucleic acid having a 

sequence complementary to the sequence of the isolated 
nucleic acid which encodes acylcoenzyme A: cholesterol 
acyltransf erase II or III. 

35 This invention also provides a method for determining 
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whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterif ication of sterol which comprises (a) obtaining 
from the subject an appropriate sample containing a 
5 mixture of all of the subject's nucleic acids; and (b) 

determining whether any nucleic acid in the sample from 
step (a) is, or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransf erase so as to thereby determine whether the 
10 subject's imbalance in sterol levels is due to a defect 

in esterif ication of sterol. The determination step (b) 
may comprises: (I) contacting the sample of step (a) with 
the isolated nucleic acid which encodes acylcoenzyme A: 
cholesterol acyltransf erase II or III or the 

15 oligonucleotide of at least 15 nucleotides capable of 

specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyltransf erase II 
or III without hybridizing to a nucleic acid which 

20 encodes a mutant acylcoenzyme A: cholesterol 

acyltransferase II or III under conditions permitting 
binding of any nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase to the 

25 nucleic acid or oligonucleotide so as to form a complex; 

(ii) isolating the complex so formed; and (iii) 
identifying the nucleic acid in the isolated complex so 
as to thereby determine whether any nucleic acid in the 
sample contains a nucleic acid which is, or is derived 

30 from, a nucleic acid which encodes a mutant acylcoenzyme 

A: cholesterol acyltransferase II or III. In this 

method, both the isolation of any complex formed are 
effected using standard methods well known in the art. 

35 In order to facilitate identification of the nucleic acid 
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from step (a) the isolated nucleic acid or the 
oligonucleotide is labeled with a detectable marker. The 
detectable marker may be a radioactive isotope, a 
fluorophore or an enzyme. In additions, the nucleic acid 
5 sample may be bound to a solid matrix before performing 

step (I) . 

This invention also provides a method for treating a 
subject who has an imbalance in sterol levels due to a 

10 defect in esterif ication of sterol which comprises 

introducing an isolated nucleic acid which encodes a 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 
or III into the subject under conditions such that the 
nucleic acid expresses a wildtype acylcoenzyme A: 

15 cholesterol acyltransf erase II or III, so as to thereby 

treat the subject. 

This invention also provides a method for inhibiting 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 

20 or III in a subject which comprises transforming 

appropriate cells from the subject with a vector which 
expresses the nucleic acid complementary to the isolated 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III, and introducing 

25 the transformed cells into the subject so as to thereby 

inhibit wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III. Further, in a preferred 
embodiment, the nucleic acid is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme A: 

30 cholesterol acyltransf erase II or III so as to prevent 

translation of the mRNA molecule. 

This invention also provides a method for inhibiting the 
wildtype acylcoenzyme A: cholesterol acyltransf erase II 
35 or III in a subject which comprises introducing an 
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oligonucleotide of at least 15 nucleotides capable of 
specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyl trans f erase II 
or III without hybridizing to a nucleic acid which 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase II or III into the subject so as to 
thereby inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransferase II or in. The oligonucleotide is 
capable of specifically hybridizing to a mRNA molecule 
encoding acylcoenzyme A:, cholesterol acyltransferase II 
or III so as to prevent translation of the mRNA molecule. 

This invention also provides for a method for identifying 
a chemical compound which is capable of inhibiting 
acylcoenzyme A: cholesterol acyltransferase II or III in 
a subject which comprises (a) contacting a wildtype 
acylcoenzyme A: cholesterol acyltransferase II or III 
with the chemical compound under conditions permitting 
binding between the acylcoenzyme and the chemical 
compound (b) detecting specific binding of the chemical 
compound to the acylcoenzyme; and © determining whether 
the chemical compound inhibits the activity of the 
coenzyme so as to identify a chemical compound which is 
capable of inhibiting acylcoenzyme A: cholesterol 
acyltransferase II or III in a subject. 

This invention also provides method for differentially 
inhibiting one acylcoenzyme A: cholesterol 
acyltransferase but not others using the above methods . 
In an embodiment, only acylcoenzyme A; cholesterol 
acyltransferase I is inhibited. In another embodiment 
only acylcoenzyme A: cholesterol acyltransferase II 
(ARGP1) is inhibited. In an another embodiment only 
acylcoenzyme A: cholesterol acyltransferase III (ARGP2 ) 
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is inhibited. Alternatively, two of the acylcoenzyme A: 
cholesterol acyltrans f erases may be inhibited. This 
invention further provides pharmaceutical compositions 
which will differentially inhibit one or more 
5 acylcoenzyme A: cholesterol acyltransf erases . 

This invention also provides for a pharmaceutical 
composition comprising the chemical compound identified 
by the above-described method in an amount effective to 
10 inhibit acylcoenzyme A: cholesterol acyltransf erase II or 

III in a subject and a pharmaceutical^ effective 
carrier . 



This invention also provides a method of treating a 
15 subject who has atherosclerosis comprising the above- 

described pharmaceutical composition. A method of 
treating a subject who has hyperlipidemia comprising the 
above-described pharmaceutical composition. 

20 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransf erase II or III. 
The mammal includes, but is not limited to, a mouse, 
bovine, cat or dog. 

25 

This invention is illustrated in the Experimental Details 
section which follows. These sections are set forth to 
aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
30 way the invention as set forth in the claims which follow 

thereafter . 
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Experimental Details 

First Series of Experiments 

Example 1 : 

5 Materia ls and Methods: 

Transformation of yeast was performed with lithium 
acetate (15) by amino-acid prototrophy selection. A 
diploid strain (5051) was constructed between two 

0 isogenic derivatives of W303 (16); W1346-3C (MATa. , 
ade2-l, canl-100, his3-ll, 15, leu2-3, 112, trpl-1, 
ura3-l) and W1134-2C (MATsl, canl-100, his3-ll, 15, 
leu2-3, 112, trpl-1, ura3-l, metl4DHpaI-SalI) . Growth on 
complete (YEPD) or synthetic medium, sporulation and 

5 dissection was performed as described ( 17 ) . 

Competent cells of Escherichia coli strain DH5a 
(Gibco-BRL) and DNA modifying enzymes (Promega) were used 
according to the manufacturers instructions. pH3(34), 

3 from L.A. Grivell, was digested with Nhe I, blunt-ended 

with Klenow sequences, and digested with Avr II to 
liberate a 1614-bp fragment. An Xba I, Sma I fragment of 
pJH-Hl encoding the HIS3 gene was then inserted at these 
sites in the vector backbone to produce the arelANA 

> allele. This construct was digested with Bsa I to 

liberate a 3821-bp fragment which was then transformed 
into strain 5051. Disruption of ARE1 was confirmed by 
Southern blot analysis. 

1 Radioactive probes of acylcoenzyme A: cholesterol 
acyltransf erase-related enzyme I were prepared by random 
priming (Pharmacia) with "P-dCTP. Genomic DNA (18) was 
transferred to Hybond membranes (Amersham) and hybridized 
in the absence of formamide at 65° or 60°C (19) . 
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A shotgun library of cosmici 14-21 from chromosome XIV 
(Peter Philippsen, Biozentrum Basel) was constructed 
using the nebulizing technique (20) . The DNA was 
nebulized (90 seconds, 2 bars), size fractionated, 
5 treated with DNA polymerase I (Klenow fragment) and T4 

DNA polymerase and blunt-end ligated into pTZ18R 
(Pharmazia, Germany) . Nucleotide sequencing was 

performed by dideoxy-chain- termination with 

digoxigenin-labeled reverse primer and Sequenase (United 

10 States Biochemical) . The reactions were analyzed on the 

GATC 1500 direct blotting electrophoresis system (GATC 
GmbH, Germany) using the Boehringer-Mannheim 
Dig-development protocol. Sequences were aligned by 
SeqMan (DNA Star Inc.). Database searching was performed 

15 with BLAST (21) and GCG Inc. software (22). The DNA 

sequence of the acylcoenzyme A: cholesterol 
acyl trans ferase-related enzyme I and acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II genes are 
deposited at GenBank (P25628 and U51790, respectively) . 

20 

KO-5 and K 0 - 3 ' primers 

( GAGG G G AC G AAAAT TAG C C G C TAT T AAT T C TGG T AT T GC C AC C TAG ACAAGAAG 
TAAACAGACACAGATGcaagagttcgaatctcttagc (Sequence ID No.: 
15) and CTATAAAGATTTAATAGCTCCACAGAACAGTTGCAGGATGCCTTAGGGT 

25 CGActacgtcgtaaggccgtttctgac (Sequence ID No.: 16), 

respectively; lower case corresponds to the LEU 2 gene) 
were used in a PCR with the LEU2 gene as a template to 
produce the selectable yeast gene flanked by acylcoenzyme 
A: cholesterol acyltransferase-related enzyme II gene 

30 sequences (23) . This was used to transform a derivative 

of yeast strain 5051, heterozygous for the arelANA 
allele. To identify integrants at the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II locus, a 
PCR was performed on genomic DNA from these strains using 
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are2-5' (CATTGCAGTTACACGTGAATGC) (Sequence ID No.: 17), 
are2-3' : (TAGCTCCACAGAACAGTTGCAGG) (Sequence ID No.: 18) 
and a 3' amplimer corresponding to the LEU2 gene (L2-3' 
C T C TGACAACAAC GAAG T C AG ) (Sequence ID No.: 19) . 



10 



1-2 units at an absorbance of 6000nm of cells were 
incubated in YEPD or defined media containing 1 /uCi/ml 
3 H-oleate in tyloxapol/ethanol (1:1) for 16 hours. Total 
lipids were prepared by hexane extraction (25) and 
analysed by thin layer chromatography on DC-plastikf olien 
kieselgel 60 plates (E-Merck, Germany) . The plate was 
developed in hexane, diethyl ether and acetic acid 
(70:30:1) and stained with iodine vapor. Incorporation 
15 of label into triglyceride and ergosterol ester was 

ascertained following scintillation counting and 
normalization to a i4 C-cholesterol internal standard and 
the dry weight of the cells. 

20 To overexpress the acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I gene by copy number 
under the control of its own promoter in YEp3-16, a 2354 
bp Cla I fragment from pH3(34), encompassing the entire 
acylcoenzyme A: cholesterol acyltransferase-related 

25 enzyme I gene, was made blunt-ended with Klenow DNA 

polymerase I and introduced into the Sma I site of 
YEp352. To constitutively overexpress acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I from the ADH 
promoter in pADH5-36, a 2290 bp Nar I fragment of 

30 pH3(34), starting 70 bp 5' to the ORF was blunt-ended 

with Klenow and ligated to Klenow-treated, Eco RI 
digested, pDC-ADH (a derivative of pS5) (26) . Increased 
expression of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I transcripts, relative to 

35 a wild-type cell, was confirmed by northern blot 
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analysis . 

The incorporation of [1- M C] acetate into saponified 
lipids was assessed as a measurement of sterol synthesis. 
5 Approximately 2 OD C00 units of cells were incubated with 

20 fxCi [1- U C] acetate in 2 ml defined media at 30°C for 
3 hours and subjected to lipid saponification, hexane 
extraction and TLC chromatography (29) . The 
incorporation of counts into total sterols were assessed 
10 following scintillation counting. To normalize the 

estimate of sterol biosynthesis to incorporation of 
acetate into the fatty acid pool, the aqueous lysate 
remaining after hexane extraction was acidified with 
concentrated HC1 and re-extracted with hexane (30) . 



15 



Experimental Discussion 



To use yeast genetics to study sterol esterif ication, the 
human acylcoenzyme A: cholesterol acyl transferase 

20 sequence was used to search for homologous yeast genes 

and subsequently to identify an additional human isoform 
(Figures 1A and IB). Acylcoenzyme A: cholesterol 
acyl transferase related enzyme I, an 1830-bp open reading 
frame (ORF) on yeast chromosome III, encodes a 610- 

25 residue protein with 23% identity and 49% similarity to 

human acylcoenzyme A: cholesterol acyltransf erasel 
(Figures 1A and IB) . The yeast and human proteins 
possess leucine zipper motifs that could mediate 
protein-protein interactions (esterif ication is probably 

30 performed by a multimeric complex) (12), and possess at 

least two predicted transmembrane domains that may 
mediate the membrane association of the acylcoenzyme A: 
cholesterol acyltransf erase reaction (13, 14) . 

35 To define the role of acylcoenzyme A: cholesterol 
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acyltransf erase-related enzyme I in sterol 

esterif ication, the deletion mutant, are 1 ANA, was 
generated by homologous recombination (15, 16, 17) (Fig. 
2A) . In a diploid strain, a 1614-bp segment of one 
5 acylcoenzyme A: cholesterol acyltransf erase-related 

enzyme I allele was replaced with the HIS3 gene and 
confirmed by Southern hybridization (Fig. 2B) . Analysis 
of mutant and wild-type haploid progeny from this diploid 
indicated no differences in growth rates or incorporation 
10 of 3 H-oleate into ergosterol ester. 

The lack of a defect in sterol esterif ication in arelANA 
strains could result from alternate esterif ication 
activities. Reduced stringency hybridization of yeast 

15 genomic DNA with the acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I coding sequence as a 
probe indicated that additional homologous sequences were 
present (18, 19) . A Bam HI digestion of genomic DNA 
produced the predicted 2.9-kb acylcoenzyme A: cholesterol 

20 acyltransferase-related enzyme I fragment and a -6.0-kb 

hybridizing fragment (Fig. 2C) . Contour clamped 

homogeneous electric field electrophoretic analysis of 
yeast chromosomes suggested the latter sequence was 
localized to chomosome X or XIV. On the basis of 

25 homology to acylcoenzyme A: cholesterol acyltransferase- 

related enzyme I, this gene, designated acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II, encodes a 
second yeast homolog to human acylcoenzyme A: cholesterol 
acyltransf erasel (Figures 1A and IB) . The genomic 

30 sequence (20, 21, 22) encompassing acylcoenzyme A: 

cholesterol acyltransferase-related enzyme II on 
chromosome XIV predicts a 5997-bp Bam HI fragment and a 
1929-bp ORF, which translates into a 643-residue 
polypeptide. The yeast acylcoenzyme A: cholesterol 
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acyltransf erase related enzymes genes are 61% and 49% 
identical at the DNA and predicted protein levels, 
respectively. Arelp, Are2p and the human acylcoenzyme A: 
cholesterol acyltransf erasel protein are most related at 
5 the COOH-terminal region (42% identity over a 90-residue 

sequence) (Figures 1A and IB) . 

To assess the contribution of Are2p to sterol 
esterif ication, one copy of the acylcoenzyme A: 
cholesterol acyltransf erase-related enzyme II coding 
sequence was deleted from the genome of an AREl/arelANA 
heterozygous diploid by a polymerase chain reaction 
approach (23) (Fig. 2D). Haploid progeny representing 
the single arelANA and are2A deletions and the arelA 
NAare2A double mutant were obtained. To ascertain the 
effect of deletion of acylcoenzyme A: cholesterol 
acytransf erase-related enzymes genes upon cytoplasmic 
lipid storage, the neutral lipid components (triglyceride 
and sterol ester) of the yeast cells were detected by 
fluorescence microscopy after staining with Nile Red 
(24) . In wild-type cells, cytoplasmic fluorescent 
droplets accumulated in stationary phase cultures (Fig. 
3A) . No differences in are single mutants were detected. 
However, the number of droplets observed in arelANAare2A 
double mutants, was one-third to that in wild-type 
strains (Fig. 3B; over multiple fields, 5,57 ± 2.73 vs. 
16.73 ± 4.6 droplets/cell, P<0.05). 

The wild-type and are mutant cells were analyzed for the 
30 incorporation of 3 H-oleate into sterol ester- (25) (Fig. 

4B) . No significant differences in triglyceride 

biosynthesis were detected. In contrast to normal sterol 
ester biosynthesis observed in arelANA mutants, 
deficiencies in sterol esterif ication were apparent in 
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both are2A and arelANAare2A mutants. These were detected 
by iodine vapor staining of thin layer chromatographs of 
total yeast lipids in addition to the oleate 
incorporation assays. Sterol ester levels of are2A 
single mutants were reduced to less than 26% of wild-type 
suggesting the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II isoform to confer the 
majority of acyltransf erase activity. The arelANAare2A 
double mutant was almost totally deficient in sterol 
esterif ication (less than 1% of wild-type levels) . In 
confirmation of the critical role of Are proteins in 
sterol esterif ication, microsomes from double mutant 
yeast cells lacked acylcoenzyme A: cholesterol 
acyltransf erase activity when assayed in vitro. 



To confirm that the protein encoded by an acylcoenzyme A: 
cholesterol acyltransferase-related enzymes ORF was 
sufficient for sterol esterif ication, the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I coding 
sequence was over-expressed in vectors with increased 
copy number (YEp3-16) or elevated transcription (the 
alcohol dehydrogenase promoter in pADH5-36) (26). There 
were no detectable changes in triglyceride or 
phospholipid biosynthesis resulting from acylcoenzyme A: 
25 cholesterol acyltransferase-related enzyme I 

over-expression. In are2A or arelANAare2A double 
mutants, acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I over-expression complemented the sterol 
esterification defect (Fig. 4C) . In wild-type and 
arelDNA single mutants, the high level expression of 
acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I did not elevate sterol ester synthesis above 
untrans formed controls. This suggests that either 
substrates are limiting in acylcoenzyme A: cholesterol 
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acyltransf erase-related enzymes strains or that the 
enzyme is post- translationally regulated as in mammalian 
cells (27) . 

5 An accumulation of unesterified sterol in cell membranes 

would likely be deleterious (28) . However, despite the 
major changes in sterol esterif ication conferred by the 
are mutants, we did not detect any reduction in growth 
rates. The established role of sterol esteri f ication in 

10 the storage of sterol suggests that an inability to 

esterify sterol could lead to homeostatic changes in 
sterol biosynthesis- This relationship might account for 
the viability of the mutants. Total lipids, labelled by 
the incorporation of [1- J4 C] acetate into exponentially 

15 growing cells (29, 30) , were saponified and extracted. 

The arel ANaare2A double mutants had a two to three- fold 
lower level of sterol biosynthesis than wild-type cells, 
although no changes were observed in the single mutants 
(Fig. 4D) . In fact, free sterol concentrations were 

20 roughly equivalent in all cells. Feedback regulation of 

sterol biosynthesis by acylcoenzyme A: cholesterol 
acyltransf erase activity has been observed in mammalian 
cells (31) and may. be a common mechanism that maintains 
intracellular sterol at non-toxic concentrations. 

25 

The involvement of multiple gene families in sterol 
homeostasis is common in mammalian and yeast cells, for 
example, the LDL receptor related protein and scavenger 
receptor gene families, the SREBP family, and 3-hydroxy- 
30 3-methyl-glutanyl-CoA reductase) (4, 32, 33, 34) . This 

apparent redundancy of function has clear physiological 
consequences as evidenced by deletion of any one of the 
family members. The observation here of two yeast genes 
for sterol esterif ication provoked the hypothesis of 
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similar redundancy for this reaction in humans. To this 
end, a consensus of the yeast acylcoenzyme A: cholesterol 
acyltransf erase-related enzymes and human acylcoenzyme A: 
cholesterol acyltransf erasel sequences was used to 
5 identify an additional cDNA with significant identity 

(47%) to human acylcoenzyme A: cholesterol 
acyltransferasel and the yeast proteins (Figure IB, 
Genbank accession # R07932) . 

10 Sterol homeostasis is a complex event under subtle 

regulatory controls, one component of which is sterol 
esterif ication. The demonstration here of multiple yeast 
and human acylcoenzyme A: cholesterol acyltransf erase 
isoforms raises the possibility that in vivo, the enzymes 

15 exhibit alternate substrate preferences. The analysis of 

esterif ication reactions in yeast is likely to impact the 
understanding of sterol homeostasis and atherosclerosis 
in humans . 

2 0 Example 2: 



Tissue specific expression of acylcoenyme A: cholesterol 
acyltransf erase II was analyzed by Northern blot RNA 
hybridization of RNA obtained from the described tissues. 

25 Using the same materials and procedures of Chang, et al . 

(10), the specific expression of acylcoenyme A: 
cholesterol acyltransf erase II in liver and muscle is 
documents, in contrast to similar experiments using the 
previously known acylcoenyme A: cholesterol 

30 acyltransferase I (10) (Figures 7A and 7B) . Acylcoenyme 

A: cholesterol acyltransferase II was also detected and 
specifically expressed in adrenal, thyroid and testicular 
tissues . 



35 
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Example 3: 

After determining the consensus sequence between the two 
yeast gene and the previously known human acylcoenzyme A: 
5 cholesterol acyltransf erase, the consensus sequence was 

compared to sequences deposited in Genbank. The clones 
containing the sequences that showed similarity to the 
consensus sequence were ordered from the I.M.A.G.E. 
Consortium, affiliated with Research Genetics, Inc., 2130 

10 Memorial Parkway S.W. Huntsville, Alabama 35801. Clones 

deposited with the I.M.A.G.E, consortium are publicly 
available upon request. A particular clone, Genbank ID 
clone No. Z39933 was chosen. This clone contains a cDNA 
fragment whose sequence encodes human acylcoenyme A: 

15 cholesterol acyltransf erase II. The fragment was cut out 

with restrictions enzymes Bgl II and Not I. The 
resulting fragment was introduced into the yeast 
expression vector pRS426 at Bgl II and Not I sites 
downstream of the yeast promoter (GAL1/GA110) which is 

20 regulated by carbon sources. The resultant vector was 

designated Y e p AB -AC AT 2 (Figure 6) . 

Example 4 : 

25 Antisense RNA technology can be used to create mice, or 

mouse or human cell lines incapable of translating 
acylcoenzyme A: cholesterol acyltransf erase II RNA into 
protein. Standard methods may be used to create an 
antisense oligonucleotide to the human homolog of 

30 acylcoenzyme A: cholesterol acyltransf erase II These 

methods are well known in the art (3 6) . 

Specifically, part or all of a wildtype acylcoenzyme A: 
cholesterol acyltransf erase II is ligated adjacent to a 
35 mammalian promoter in the opposite orientation. The 
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promoter and other replicatory mechanisms inside the cell 
will transcribe a human homolog of acylcoenzyme A: 
cholesterol acyltransf erase II encoding, nonsense strand. 
This strand will bind with the coding mRNA which is 
5 normally synthesized to form a complex. Due to the 

formation of this complex, the antisense strand prevents 
the translation of the coding mRNA into protein. 

Further, one skilled in the art can synthesize an 
10 oligonucleotide in vitro which is capable of binding the 

mRNA that encodes a human homolog of acylcoenzyme A: 
cholesterol acyltransf erase II so as to inhibit the 
translation of the mRNA into . protein. The 
oligonucleotides can then be introduced into the subject 
15 using a pharmaceutical^ acceptable carrier. Methods of 

synthesizing naturally and non-naturally occurring 
oligonucleotides which are capable of inhibiting the 
translation of the mRNA into protein are well known in 
the art. Also, means of transfecting an organism with 
20 such oligonucleotides are well known in the field. 

Example 5 : 

Mice can be made with an alteration in their genome, 
25 specifically at the acylcoenyme A: cholesterol 

acyltransferase II gene site. Standard methods may be 
used to alter the genome. These methods are well known 
in the art (37, 38) . 



3 0 One such process to achieve this goal involves disrupting 

the wildtype mouse homolog of acylcoenyme A: cholesterol 
acyltransferase II in vitro, then introducing the altered 
gene into mouse embryonal stem cells in such a way as to 
taret integration into the corresponding genomic region. 

35 This process can be performed such that both copies of 
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the wildtype acylcoenyme A: cholesterol acyltrans f erase 
II are replaced by the altered, knock-out version. These 
modified cells can be introduced into blastocysts which 
will be allowed to develop into chimeric adults. Mice 
5 bearing the altered acylcoenyme A; cholesterol 

acyltransf erase II gene will be mated to each other to 
generate homozygous mutant acylcoenyme A: cholesterol 
acyltransf erase II animals. 

10 Further, one can breed two mice who are heterozygous for 

mutant acylcoenzyme A: cholesterol acyltransf erase II. 
From their progeny, one skilled in the art could select 
the progeny who are homozygous for mutant acylcoenzyme A: 
cholesterol acyltransf erase II. Breeding and selecting 

15 such progeny are well known in the art. 
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S cond Series of Experiments 

The efficient regulation of intracellular sterol levels 
is required for cell viability by all eukaryotic 
5 organisms. When this regulation is aberrant in cells of 

the arterial wall, disease states such as atherosclerosis 
ensue. A critical component of this homeostasis is 
intracellular sterol esterif ication reaction, mediated by 
the enzyme, acyl coenzyme A-cholesterol acyltransf erase 

10 ( ACAT ) . In the model eukaryote, yeast, this laboratory 

has demonstrated that sterol esterif ication is mediated 
by a two gene family (Yang et al . , Science . 1996, 
272:1353). The existence in human cells of two 
additional genes encoding ACAT related enzymes are 

15 demonstrated. These protein are termed ACAT related 

gene products (ARGP) l and 2, also known as acylcoenzyme 
A: cholesterol acyltransf erase II and acylcoenzyme A: 
cholesterol acyltransf erase III respectively. The ARGP s 
exhibit marked sequence conservation to the human ACAT 

20 sequence (hACAT) originally identified by Chang and 

colleagues. ARGP1 is expressed at high levels in 
intestine and liver in contrast to the expression of 
hACAT which is of low abundance in these tissues. The 
observation that knock-out mutant mice deficient in the 

25 murine homolog if hACAT retain sterol esterif ication 

activity in liver and intestine (Meiner et al . , PNAS, 
1996, 93:14041), suggests that ARGP1 is a candidate for 
sterol esterif ication in these tissues. The expression 
of ARGP2, by contrast, seems to be restricted to the 

30 fetal liver, suggesting it to have a role in lipid 

metabolism during development. Analysis of genome 
databases indicates that ACAT- like gene families are a 
common occurrence in multiple organisms. It is 

hypothesize that multiple enzymes for sterol 

35 esterif ication will provide flexibility in response to 

differing sterol and fatty acid substrates encountered by 
different tissues. This further suggests specific roles 
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for these enzymes in lipoprotein production, lipid 
homeostasis, and disease progression. 

The regulation of membrane sterol levels is required for 
5 cell viability by all eukaryotic organisms. When this 

regulation is aberrant in human cells, disease states 
such as atherosclerosis (excessive accumulation of 
cellular esterified cholesterol in cells of the arterial 
wall, reviewed in (1-4))/ Niemann Pick C {inability to 

10 store sterol correctly, resulting in lysosomal lipidosis, 

(5)) or Wollmann's disease (a defect in sterol ester 
hydrolysis, (6) ) ensue. A critical component of this 
homeostasis is the intracellular neutralization of sterol 
by an esterif ication reaction between the C 3 -0H group of 

15 cholesterol and fatty acyl-coenzyme A. This reaction is 

performed in mammalian cells by the enzyme acyl coenzyme 
A-cholesterol acyltransf erase (ACAT) . Since the process 
of sterol esterif ication converts sterol into a 
cytoplasmic storage form, it is critical to all 

2 0 eukaryote, including the microorganism Saccharomyces 

cerevisiae (budding yeast) . Analysis if sterol 

homeostasis in this model organism has the advantage that 
molecular genetics, particularly since the completion of 
the yeast genome sequencing project, is powerful and 

2 5 relatively straightforward. Taking advantage of this, it 

is demonstrated that sterol esterif ication in yeast is 
mediated by a two gene family (7) , neither of which is 
essential for life. These genes ( ARE1 and ARE 2 ; 
encoding ACAT R elated Enzymes 1 and 2, respectively) are 

30 both capable of independently esterif ying sterol, 

although in terms of contribution to the sterol ester 
mass of the cell, Arel is a minor isoform relative to 
Are2. The genes are structurally and functionally 
analogous to the ACAT sequence isolated originally from 

35 macrophages by Chang and colleagues (8) . They share 

approximately 23% identity at the protein level and 
expression of the human macrophage ACAT cDNA in yeast are 
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double deletion mutants results in esterif icat ion of 
sterol (9) . 

A critical test of the role of the ACAT gene product in 
5 cholesterol homeostasis and atherosclerosis was initiated 

by Farese and colleagues, by the production of "knock- 
outs" at the Acact locus corresponding to the mouse 
homolog of hACAT (10) . The fidelity of the mutation was 
confirmed by sequencing of cDNA from the disrupted allele 

10 and by the failure to detect immunoreactive protein in 

Acactf^cell extracts. The animals were healthy and 
fertile and had residual, but significant, sterol 
esterif ication activity in fibroblasts ahd macrophages. 
Cholesterol ester levels and ACAT activity in the 

15 adrenals were also severely reduced. Conversely, Acact^ 

livers contained significant levels of cholesterol ester, 
and esterif ication activity was not altered. 
Furthermore, sterol absorption in the intestine, a 
process that probably requires esterif ication, was 

20 unaffected by the gene disruption. These observations 

strongly suggest that as in yeast, there are multiple 
genes for the ACAT reaction in mammalian cells, probably 
with tissue specific expression patterns. 

25 Interestingly, despite the clear origin of the yeast gene 

family by gene duplication, the ARE proteins have 
diverged such that the majority of sequence conservation 
is in the COOH- terminal domain of the protein. This is 
presumably the critical region of the molecule, since it 

30 is also conserved in the human protein. Using this 

region as a database probe, R07932 (11) was identified, 
a partially sequenced cDNA entry in the database of 
expressed sequence tags (best) ; R07932 exhibits 
significant similarity to the ACATs particularly over the 

35 COOH- terminal region. Taken together; the "founder" 

sequence, the observations in yeast of a two gene family 
for sterol esterif ication, and the tissue-specific 
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expression patterns of enzyme activity in Acact^ knock-out 
mice, suggest that there are multiple genes for this 
reaction in all eukaryote. It is reported here the 
isolation and characterization of cDNAs from two human 
5 loci that encode ACAT Related Gene Products (ARGP) . 

ARGP1 is represented multiple times in the best, 
including R07932, and is expressed ubiquitously with the 
highest levels occurring in the liver, intestine and 
adrenal gland. By contrast, sequences identical to ARGP2 
10 in the databases are infrequent, consistent with the 

observation of an essentially embryonic pattern of 
expression. Analysis of genome databases indicates that 
gene families that conserve these motifs are a common 
occurrence in multiple organism. 

15 

Materials and Methods. 

Database searching for ACAT related sequences. A 
sequence corresponding to the strongest region of protein 
conservation between the human macrophage ACAT and yeast 

20 ARE sequences was used to identify protein sequences 

predicted to be encoded by entries in the best using the 
tblastn software (NCBLI) . The DNA sequences thus arising 
were used to detect additional clones in any available 
database, that demonstrated overlaps of nucleotide 

2 5 sequence identity. Databases searched included; best, 

the non-redundant GENBANK, and the confidential database 
held at The Institute of Genome Research (TIGR) . 
Overlaps between these sequences were detected using the 
sequence alignment programs, "lineup" and "pileup" from GCG 

30 Inc (Madison, WI) . A consensus sequence was then 

generated. Escherichia coli clones with the largest 
inserts corresponding to these sequences (see table 1) 
were obtained from the I.M.A.G.E. consortium and 
resequenced from both ends using commercial primers, T3 

35 and T7 , or internal primers derived from a consensus. 

Nucleotide sequencing was performed at the Columbia 
University Combined Center core facility using an Applied 
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Biosystems fluorescent sequencing machine 



Table 1 



Entries of human ACAT related gene products in 
the products in the data base of expressed 
sequence tags. 



10 



15 



20 



25 



30 



35 



Gene 



ARGP1 



ARGP2 



Clone ID 
(IMAGE) 

200587 

55218 



1881180 

78614 
153836 

106260 

128921 

213176 
245265 



GENEBANK ID 

R99213 

R99214 

C-IMF11 

243867 

Z33993 

H45923 

H45924 

M79086 

R4 84 74 

R48475 

T35085 

R10272 
R10273 
N75438 
H76642 



Insert size 
(bp) 

620 

1800 



1000 

300 
800 

800 

680 

540 
300 



Comments 



chimera 



Isolation and sequenc ing analysis of full length cDNA 

clones of ARGP 1 and ARGP2 . Since in no instance were any 
of the database clones full length for either ARGP1 or 
ARGP2 , additional clones with intact 5' -ends are 
described. Several strategies were chosen using a 
consensus nucleotide sequence derived from the sequencing 
of the best clones designed and synthesized 3 -end, gene 
specific primers and used a PCR based, rapid 
amplification of cDNA ends (RACE) to derive 5' -RACE 
reaction products from a human liver/spleen Marathon 
library (Clontech*) . Similar strategy was used to derive 
PCR products from a human fetal brain library generously 
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provided by Bento Soares (Columbia University) . In some 
instances, a nested PCT reaction was performed using 
internal gene specific primers and library adaptors. 
Finally primer extension cDNA products were identified 
5 from rnRNA extracted from human intestine (a kind gift of 

P. Dawson). Amplification products of the predicted size 
were confirmed as gene specific, using southern 
hybridization to sequences predicted to be at the 3 '-end 
of these products. The products were isolated from 

10 agarose gels using Geneclean and subcloned into TA 

variants of pBluescript (Stratagene*) vectors of 
klenow/kinase treated and blunt end ligated to pGEM2 
(Promega*) . Positive clones were identified by colony 
.hybridization or by PCR amplifications using an internal 

15 ARGP specific primer. Clones with the largest inserts 

were sequenced to obtain novel sequence and where 
necessary* this process was reiterated with ARGP 5' 
specific primers derived from the new sequence. 

20 Tissue specific expression of hACAT and ARGds . Fragments 

of the best clones R99213 and R10273 corresponding to 
ARGP1 and ARGP2 , respectively were derived by digestion 
with EcoRI and Not I , and purified from agarose gels with 
Geneclean. A 1.6 kbp fragment corresponding to the human 

25 ACAT cDNA identified by Chang et al was used as a probe 

for the expression of this gene. Radiolabelled probes 
were generated by random priming (Pharmacia®) in the 
presence of 32 -P dCTP and used to probe Multiple Tissue 
Northerns (MTN, Clontech®) of human samples. 

30 Hybridizations were performed, according to the 

manufacturers instructions, using ExpressHyb rapid 
hybridization solution for 1 hour at 78°C, followed by 
washed in 2xSSC at 55°C and O.lxSSC, 0.5%SDS at 50°C. 

35 Cell culture expression of ARGPs . To facilitate 

quantitation of rnRNA from the ARGP genes, a reverse - 
transcriptase PCR {RT-PCR) approach was devised to 
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analyze expression in a variety of human (HeoG2 , THP-1 
macrophages) and rodent (J774 macrophages) and simian 
(CVl kidney cells) . Where possible, primers were 
designed to be conserved between rodents and humans (as 
5 described below, the mouse sequence homolog to ARGP1.) has 

been identified. Alternatively, PCR conditions were 
optimized to permit moderate mismatches. The ARGP 
amplification primers were designed to be gene specific 
(i.e. to regions not conserve within the family) and to 
10 produce distinct size products. 

Experimental Results and Discussion 

The approach that the region of strongest conservation 
between the yeast ARE proteins and hACAT would be 

15 critical to the function of any sterol esterif ication 

enzymes was taken. A region of conservation (consensus; 

LN---E FGDR-FY GDWWN, single letter amino-acid code) 

that is invariant over the three proteins was chosen and 
a series of entries derived from gene sequencing projects 

20 identified. In addition to sequences from 

Caenorhabditis . elecrans, Schizosacharomuces pombe, 
Drosop hila melanocrater and Arabidopis thaliens , several 
entries in the best of human cDNAs that suggested an 
independent gene encoding an ACAT like protein were 

25 observed. Using the nucleotide sequence to this clone, 

a second homologous but distinct entry was identified. 
These proteins are termed, ACAT Related Gene Products 
(ARGP) 1 (acylcoenzyme A: cholesterol acyltransf erase II) 
and 2 (acylcoenzyme A: cholesterol acyltransf erase III) . 

30 The sequence identified by Chang et al (8) will be 

referred to as hACAT, hereon. A limited protein sequence 
to a founder clone (R07932) to ARGP1 has been presented 
previously (11) . The entries in the best that define 
these two genes, including their insert sizes are 

35 described in table 1. As is evident, the majority of 

inserts (with the exception of a chimeric clone ZA3867) 
are less than lKbp. The northern and sequence analysis 
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presented indicated them to be incomplete clones. 
However, they clearly define two distinct genes of strong 
similarity to the ACAT sequence, with the majority of 
predicted protein conservation at the COOH- terminal 
5 region. As described below certain motifs considered 

critical to sterol esterif ication are conserved. To 
identify the role of these genes in the reaction, full 
length ARGP clones were sought and their patterns 
identified, 

0 

ARGP1. a ubiquitously expressed member of the ACAT gene 
family. To establish the profile of expression of ARGP1 , 
probed multiple tissue northerns of human mRNA was 
probed, using a fragment close to the 3' end of the gene. 

5 Although this region displays the maximum conservation at 

the protein level in this gene family, the genes are 
sufficiently divergent at the DNA level to be able to 
design gene specific hybridization probes. The ARGP1 
sequence is expressed at abundant levels in may tissues 

0 with the exception of lung and kidney. The majority of 

tissues express a 2 . Okb message but, some tissues (e.g. 
adrenal, small intestine, thymus) also express a 2.4kb 
mRNA at varying levels. The same northerns were 
hybridized with a probe to the human macrophage ACAT 

5 sequence. As described by others ( 8 , 12 , 13 ) , the hACAT 

sequence detects 4 messages of approximately 3.0, 4.0, 
4.7 and 7.4Kb. Upon comparison of the two hybridization 
results, an overlapping but occasionally differential 
expression pattern was observed. Adrenal tissues express 

0 the highest levels of both hACAT and ARGP1 message . By 

this analysis, hACAT messages are rare in liver and 
intestine in contrast to ARGP1 which is highly expressed 
in these tissues. Conversely, ARGP1 was poorly expressed 
in kidney, lung and placenta although hACAT mRNA was 

5 easily detected. This tissue specific expression 

suggests that ARGP1 is an ideal candidate for sterol 
esterif ication in tissues such as liver and intestine, 
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which retain sterol esterif icat ion activity in ACAT k/o 
mice (10) . 

ARGP2 , — an embryon ic isoform of the ACAT gene family. 

5 Efforts to identify a transcript from ARGP2 in adult 

tissues were unsuccessful. Therefore embryonic tissue 
samples were chosen to investigate since the original 
founder clone was derived from a fetal liver library. A 
multiple tissue northern of rnRNA from human embryonic 

10 brain, liver, kidney, and lung, were probed with and 

ARGP2 specific, COOH- terminal probe. As shown in figure 
9, a single message of -2.2kb was identified only in 
embryonic liver tissues, suggesting a high degree of 
tissue and developmental specificity to the expression of 

15 this gene product. 

Expression of ARGP1 in cell culture models. To develop 
a system in which to test the effect of reaction 
substrates on the esterif ication reaction performed by 

20 the ARGP enzymes. The expression of these genes in 

several tissue specific were examined, cell culture 
models. As shown in figure 10, ARGP1 is clearly 
expressed in liver (HepG2) and Kidney (CV-l) cell lines. 
The latter result is somewhat in contrast to the northern 

25 blot on human tissue samples. This most likely reflects 

the sensitivity of the RT-PCR approach compared to filter 
hybridization and suggests that ARGP1 is probably 
expressed in most tissues. Alternatively it may 
represent species difference (simian vs. human) or more 

30 interestingly the differentiation status of the cells 

under study. In data not shown here, ARGPl was also 
clearly expressed in human and mouse macrophage models 
(THP-1 and J774 cells) . 

35 Sequenc e characteristics of ARGPl and ARGP 2 . By a 

combination of 5' -RACE and primer extension additional 
sequence to cDNA s for ARGPl and ARGP2 (Figs. 11 and 12) 



BNSDOCID: < WO 9745439A 1 _!_> 



WO 97/45439 




PCT/US97/09460 



-51- 

have been identified. The ARGP1 sequence predicts a 407 
amino-acid protein with approximately 27% identity and 
52% similarity to the hACAT protein (Fig. 4) . 
Interestingly, as it was observed for the yeast ARE 
proteins, the strongest conservation exists at the COOH- 
terminus of the molecules, to the extent that the NH-2- 
terminal 50% of all these proteins is essentially 
unrelated sequence. This pattern also persists at the 
DNA level (not shown) . Identification of the genomic 
sequence to these cDNAs will establish whether this 
remarkable divergence arises by exon shuffling of common 
sequences. Alternatively, convergent evolution of 

domains with conserved functions in sterol esterif ication 
or related processes, may have resulted in the generation 
of these families. Since the level of DNA conservation 
between ARGP1 and hACAT is quite low (3 7% identity) , the 
latter possibility seems likely. The conserved regions 
are discussed in the context of multiple ACAT like 
sequences below. The ARGP1 sequence predicts a protein 
of approximately 4 7kDa with multiple transmembrane 
domains in similar positions to those predicted in hACAT. 
This strongly suggests a membrane location for ARGP1 as 
would be predicted for a sterol esterif ication enzyme. 

2 5 ARGP2 displays a significantly higher level of amino acid 

conservation with hACAT than does ARGP1 . Over the 
sequence shown (Fig. 12) , the protein is 59% identical 
and 79% similar to human ACAT. Over the same region 
ARGP1 is only conserved at the level of 32% identity. 

3 0 This striking identity is maintained at the DNA level 

(62% identity) and may suggest that ARGP2 is more closely 
analogous to hACAT in both its mechanism of action and 
its origin, than is ARGP1 . As for ARGP1, certain 
hallmark sequences are retained in ARGP2 (see below) . 
35 The ARGP2 predicted protein also possesses several 

predicted transmembrane domains. One entry to the best 
for ARGP2 has also been allocated an STS (sequence tagged 



10 



15 , 
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site) at the Whitehead Institute, (entry # WI-H660) and 
has thus been mapped to human chromosome 12. 

Sterol ester if ication enzvmes evolve as gene families in 
5 multiple organisms. Using the hACAT and AGRP nucleotide 

sequences as probes of multiple databases, we sought to 
establish whether the observation of gene families of 
ACAT related enzymes in yeast and humans was a common 
occurrence in other organisms. In general this is the 

10 case (Fig. 13). Sequences from the genome of C. eleoans. 

EL — melanogastor and S . pombe . have been identified that 
are distinct from each other, within an organism, and 
exhibit approximately 25% identity at the predicted 
protein level. As for all the ACAT- like proteins, the 

15 maximum conservation is observed at the COOH- terminal 

region, with many of the apparently critical motifs 
described below, being maintained. As would be 

anticipated the mouse cDNA for ARGP1 exhibits 
approximately 85% identity with its human homolog. 

20 

Sequence conservation between ARGPs and ACAT in multiple 
organisms . As described above, these sequences are 
ubiquitous. This conservation, across and within 

organisms, facilitates the identification of critical 

25 domains of esterif ication enzymes (Fig. 14). 

Interestingly, there is no sequence similarity between 
any ACAT- like molecule and lecithin cholesterol 
acyltransferase (LCAT) , despite the shared utilization of 
cholesterol. For the hACAT sequence and its murine 

30 homologs, a similarity to "signature' motifs of enzymes 

involved in acyl adenylation reactions was reported (8, 
12) . However, these sequences are unlikely to be 
critical, since they are not conserved in any homolog 
from any other organism. By contrast, there are regions 

35 of strong conservation between these molecules which may 

be critical to function. In the esterif ication 

defective, SRD4 mutant CHO cell line, the expressed but 
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defective ACAT allele encodes a single amino-acid 
substitution of leucine 265 lies in a conserved domain of 
human, rodent and yeast ACAT. Interestingly, this motif 
in ARGPl is more degenerate, although the serine is 
5 conserved, the flanking sequence is conservatively 

replaced by similar residues. The ACAT reaction is 
probably mediated by a multimeric complex, as shown by 
radiation inactivation experiments (15) . Accordingly, 
the yeast and human sequences all possess "leucine zipper" 

10 multimerization motifs. ARGPl and ARGP2 lack a classical 
multimerization motif. Although protein phosphorylation 
as a mode of ACAT regulation has been refuted (16) , a 
very strong region of conservation (consensus over 7 
sequences; LN E FGDR-FYGDWWN, single letter amino- 

15 acid code) predicts a tyrosine kinase consensus motif for 

phosphorylation. ARGP2 and ARGPl are no exception to 
this. In particular the aspartic acid- tryptophan - 
tryptophan- asparagine (DWWN) sequence appears to be 
invariant (with the exception of S.pombe. where it is 

2 0 AWWN) and may represent an active site for the 

esterif ication reaction. These regions if conservation 
are targets for mutagenesis and in preliminary 
experiments appear critical to the activity of the ACAT 
and ARE enzymes (17) . 

25 

Why ACAT gene families? The role, if any, of these ACAT 
sequence homologs in sterol homeostasis is unclear. 
Since mouse macrophage ACAT is not critical to sterol 
esterif ication in the liver and intestine, it is possible 

30 that the additional enzymes evolved to recognize 

alternate substrates and thus promote sterol absorption 
in the intestine or production of lipoproteins by the 
liver (39) . Future experiments will be directed to 
complete the molecular characterization of these genes 

35 and test these hypotheses. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: Stephen L . Sturley 

(ii) TITLE OF INVENTION: DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 

ACYLTRANS FERASE 11 AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 1185 Avenue of the Americas 
<£> CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v> COMPUTER READABLE FORM: 

(A) * MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

© OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: Herewith 
© CLASSIFICATION: 

<viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: John P. White 

(B) REGISTRATION NUMBER: 28,678 
REFERENCE /DOCKET NUMBER: 0575/50852-A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 
€> TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 364 9 base pairs 

(B) TYPE: nucleic acid 
© STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GGGTAGAGAC GGGGTTTCAC CGTGTTAGCC AGGATGGTCT GGATCTCCTG ACCTCGTGAT 60 
CCACCCACCT CGGCCTCCTA AAGT GCTGGG ATTACAGACA TGAGCCACCG CGCCCAGCCC 120 
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T AT TC AT C C C TTTTCAAAAG TCAGACCCTA GGAAGCTGGA GGGAGGTGGG GCATGGTTTT 18 0 

ACAGTGAATT TCTGATTTCA CTCAGGGTGA TAAATCAGAC TCTTGGGGAA GCGGGTGGTG 24 0 

GCTCTGGACA GCAGCAGGAA TGGGGATCCA GTTAGCAACA AATCCATGGA CCTATGACAG 300 

GCTGAAAGCC ACCCCTTCTC CATCTTTGGG AGGTTGCCAA TGTCTGATTT AAC AC TAT CC 360 

AATGAATGAT CATTGAAAGT AAAAAATAAC TATCAACTAG CAGAAAATAT AAATGGTAAG 420 

CATTAGCACA TATTTCACAT GTTTATATTT GGCTCTCAGA TTGACCTATA AAACAAAGTC 4 80 

TGGGAAATTC T ATAT GAT CC TGAAAAAATG ATACGCTGGT CTGGATGGTA GAATAAGTTG 54 0 

GAGAAATGTT T AAGC CAAAA TGCAGTCTTA CCAATGACTT TTTATTTTAT TTTATTAATT 600 

TTCAGGATTT TTGGTATACA GGTGGTTTTT GGTTACATGG AAAAGTTCTT T ACT GGT GAT 660 

TTCTGAGATT TTAGTTCACC CCTTATCCTG AGCAGTGTAC ACTGTTCCCA ATATGTAGCC 720 

TTTTATCCCT CACCCCCTCT AAGTTCAAGA AGACTAT GGT CCTGCAGAAA GCTTT AT AT G 7 80 

TAATTAACAT ATCTTTATCT TTATCTTTAT AGGCAGTAGA CTCATCTTTT GAAACAGATT 840 

CCATTAAGAG TGAATGTGTA CCCTCCCTCT AGCCTTTATT ATTACTGTTT TTGCTATTAC 900 

ATGTGTTAGT GTATGTGAAT TTAATGCTTA AAAATGTATC CCATTGGCTA CT AT GGCAAA 960 

AGGTTGACTC ATAAGAGTTT AGCACGGGTT AAGATCTGAA AGTTTTCTCC CAGC CTCTTA 102 0 

TCACTGGCGC AGACTTCACA ATT CAT GGAA GCCACCAGTG AGATGACATT GCCTCAGGCA 108 0 

GTTACTATTT T TAT ATT CT A TAACTCGAGG AGCTCAGGGT TTCGGAAATC ATTAAAC TTT 114 0 

TTTTGTCCTT TTAAAGTTGG AGACAGCAAT TGTAGACAGC CTTCCAGTGG GTTATCTTTT 1200 

TGTGTCTCCT TACCTGTGGA GAAGCCTATT AGCTGGGATA TGTAGTTAAA TAGCTATATT 1260 

TATATATATC CAGGGCACCC CGAATTCGGG AGAGCTTCCC GGAGT CGAC C TTCCTGCTGG 1320 

CTGCTCTGTG ACCGCTTCCC GGCTCTGCCC TCTTGGCCGA AGTGCCCGCT GCCGGGCGCG 1380 

GGCCTCAGAC AATACAATGG TGGGTGAAGA GAAGATGTCT CTAAGAAACC GGCTGTCAAA 144 0 

GTCCAGGGAA AATCCTGAGG AAGAT GAAGA CCAGAGAAAC CCTGCAAAGG AGTCCCTAGA 1500 

GACACCTAGT AATGGTCGAA TTGACATAAA ACAGTTGATA GCAAAGAAGA TAAAGTTGAC 1560 

AGCAGAGGCA GAGGAATTGA AGCCATTTTT TAT GAAGGAA GTTGGCAGTC ACTTTGATGA 1620 

TTTTGTGACC AATCTCATTG AAAAGT CAGC ATCATTAGAT AATGGTGGGT GCGCTCTCAC 168 0 

AACCTTTTCT GTTCTTGAAG GAGAGAAAAA CAACCATAGA GCGAAGGATT TGAGAGCACC 174 0 

TCCAGAACAA GGAAAGATTT TTATTGCAAG GCGCTCTCTC TTAGATGAAC TGCTTGAAGT 1800 

GGACCACATC AGAACAATAT AT CACAT GTT TATTGCCCTC CTCATT CTCT TTATCCTCAG 1860 

CACACTTGTA GTAGATTACA TTGATGAAGG AAGGCTGGTG CTTGAGTTCA GCCTCCTGTC 1920 

TTATGCTTTT GGCAAATTTC CTACCGTTGT TTGGACCTGG TGGATCATGT TCCTGTCTAC 1980 

ATTTTCAGTT CCCTATTTTC TGTTTCAACA TTGGCGCACT GGCTATAGCA AGAGTTCTCA 2040 
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TCCGCTGATC 


CGTTCTCTPT 


TPPATGGPTT 

X v^«WVX UUVm X 1 


TfTTTTP TV IT 
1^1111 A 1 G 


AT CTT C CAGA 


TTGGAGTTCT 


2100 


AGGTTTTGGA 


C C AAC AT AT G 

X^4T^#^^^4^^ 4V £ VJ 


TTGTGTT AGP 

X X \J X O X X rtw 


rtlMl AVLHL 1 G 


LLACCAGCTT 


CCCGGTTCAT 


2160 


CATT AT ATT C 


GAGCAGATT C 


GTTTTGTAAT 

X X X X VJ X ^V"V X 




rp yv rp rp rp / «f v i n tv 

1 L-ATTTGTCA 


GAGAGAACGT 


2220 


GCCTCGGGTA 


CT AAAT T CA G 


CTAAGGAGAA 


>\ 1 LAHljLAv, 1 


G I TCCAATAC 


CTACAGTCAA 


2280 


CCAGT AT T T G 


TACTTPTTAT 

X w X X ^. X X /A X 


TTGPTPPTHP 

X X VJ^> X X A\rf 


P P T m B T» P T 7A P 
V/^l 1A1L1 ML 


rp /"^ TV TV r+r+rw\ 

u GT GACAGCT 


ATCCCAGGAA 


2340 


TCC CACT GT A 


A G2\ TGGfZGTT 


Ml Gl LuL 1 A 1 


GAAGT T T GC A 


CAGGTCTTTG 


GTTGCTTTTT 


2400 


TTATfiTHTflr 

^ l/Al V3 X VJ X Mv> 


X -rA\—/-Y X X X X \3 


MHMbuL-1 lib 


iGCCCCCTTG 


TTTCGGAATA 


TCAAACAGGA 


2460 


gp pp ?t p a gp 


VjV-. X L Lr x 1 ^ 


x GG I 1 AU G 


T GGT AT T T AA 


CTCCATCTTG 


CCAGGTGTGC 


2520 


T G IVTT PT fTT 


Lv^i J M.V~ 1111 


111 bLL 1111 


TGCACTGCTG 


GCTCAATGCC 


TTT GCTGAGA 


2580 


1 Ul l>\0*jl^± 1 


1 GG 1 GML.AGG 


T\ Ti / 'film o nr» tv m t\ 

A 1 GTT C TATA 


AGGATTGGTG 


GAACTCCACG 


TCATACTCCA 


2640 




AALL 1 GGAA I 


GTGGTGGTCC 


AT GAC T GGCT 


ATATTACTAT 


GCTTACAAGG 


2700 


TV ri i rp rp /^fp i*"*rp 


GTTTTTCTCC 


AAGAGATT CA 


AATCTGCTGC 


CAT GTTAGCT 


GTCTTTGCTG 


2760 


IniLl GL. 1 G 1 


tv ot*7\ TV TV 

AG 1 ACAC GAA 


TATGCCTTGG 


CTGTTTGCTT 


GAGCTTTTTC 


TATCCCGTGC 


2820 


Ibi 1 OG 1 GL 1 


C TT CAT GTT C 


T T T GGAAT GG 


CTTTCAACTT 


CAT TGT CAAT 


GATAGT C GGA 


2880 


rUVViwLV,. GM 1 


1 1 GGAATGTT 


C T GAT GT GGA 


CTTCTCTTTT 


CTTGGGCAAT 


GGAGT CTT AC 


2940 


1 L- 1 1 1 I I A 


rprp /~»rp z-' Tv t\ /— 7\ tv 

I 1 CTCAAGAA 


T GGT AT GCAC 


GTCGGCACTG 


TCCTCTGAAA 


AATCCCACAT 


3000 


X 1 X 1 yj\jJ-K. 1 1M 


ivjl v-L.GGL.GA 


C GTT CCTGGA 


CTTGT C GTTA 


C GT GTTTT AG 


AAGCTTGGAC 


3060 


X X X V7 1 i i \**\* 1 


G G 1 1 G 1 GAG 1 


GAAGAT T G GG 


TAGCTCCCTG 


ATTTGGAGCC 


AGCT GTTTCC 


3120 


AGTTGTTTaPT 
MO X X V7 X X MV« X 


flTA TA /T^T 1 T 1 TA *P P T 1 


blbl 1A1 iTG 


GAC C AC T C C A 


GGCTTTACAG 


AT GAC TC ACT 


3180 


C P ATT p pt a fZ 




LAmAC T GT 


TGGAAGTTCA 


CTGGAGTCTT 


GTACACTTAA 


3240 




1111111b 


1 buabu 1 GGG 


TGGGGGGAGA 


AGACC GACTA 


ACAGCTGAAG 


3300 


T a at an p a r;n 


1 1<j1 1 1 


G 1 W\ I A 1 LAb 


rp rp rT» tv rp #tfrt 

CTTTATCCCT 


T GGT AATT AT 


ATCTGTTTTG 


3360 


TTTPTT f^TA rT 
X X X V* X X UMLi X 


p T f^T rr ta n t n 

v-iul LL«M 1 


MGAGAA 1 AAA 


CAT C ATAGTT 


TCTTGGCCAC 


TGAATTAGCC 


3420 


AAAACACTTA 


GGAAGAAATC 


ACTTAAATAC 


CTCTGGCTTA 


GAAATTTTTT 


CATGCACACT 


3480 


GTT GGAATGT 


ATGCTAATTG 


AAC AT GCAAT 


TGGGGAAGAA 


AAAATGTAGA 


AT GATTTTT G 


3540 


CTATTTCTAG 


TAGAAAGAAA 


ATGTCTGTTT 


TCCAAAGATA 


AT GTT AT ACA 


TCCTATTTTG 


3600 


TAATTTTTTT 


GAAAAAAGTT 


CAATGTTCAG 


TTTTCCTTAGT TTTTACCTT 




3660 


(2) INFORMATION FOR SEQ ID NO: 2: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Val Gly Glu Glu Lys Met Ser Leu Arg Asn Arg Leu Ser Lys Ser 
1 5 . 10 is 

Arg Glu Asn Pro Glu Glu Asp Glu Asp Gin Arg Asn Pro Ala Lys Glu 
20 25 30 

Ser Leu Glu Thr Pro Ser Asn Gly Arg He Asp He Lys Gin Leu lie 
35 40 45 

Ala Lys Lys He Lys Leu Thr Ala Glu Ala Glu Glu Leu Lys Pro Phe 
50 55 60 

Phe Met Lys Glu Val Gly Ser His Phe Asp Asp Phe Val Thr Asn Leu 
65 70 75 80 

He Glu Lys Ser Ala Ser Leu Asp Asn Gly Gly Cys Ala Leu Thr Thr 
85 90 95 

Phe Ser Val Leu Glu Gly Glu Lys Asn Asn His Arg Ala Lys Asp Leu 
100 105 110 

Arg Ala Pro Pro Glu Gin Gly Lys He Phe He Ala Arg Arg Ser Leu 
115 120 125 

Leu Asp Glu Leu Leu Glu Val Asp His He Arg Thr He Tyr His Met 
130 135 140 

Phe He Ala Leu Leu He Leu Phe He Leu Ser Thr Leu Val Val Asp 
145 150 155 160 

Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Ser Leu Leu Ser Tyr 
165 170 175 

Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp Trp He Met Phe 
180 185 190 

Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin His Trp Arg Thr 
195 200 205 

Gly Tyr Ser Lys Ser Ser His Pro Leu He Arg Ser Leu Phe His Gly 
210 215 220 

Phe Leu Phe Met He Phe Gin He Gly Val Leu Gly Phe Gly Pro Thr 
225 230 235 240 

Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser Arg Phe lie lie 
245 250 255 

lie Phe Glu Gin lie Arg Phe Val Met Lys Ala His Ser Phe Val Arg 
260 265 270 

Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu Lys Ser Ser Thr 
275 280 285 

Val Pro lie Pro Thr Val Asn Gin Tyr Leu Tyr Phe Leu Phe Ala Pro 
290 295 300 
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Thr Leu He Tyr Arg Asp Ser Tyr Pro Arg Asn Pro Thr Val Arq TrD 
305 310 315 ^ 320 

Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly Cys Phe Phe Tyr 
325 330 335 

Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu Phe Arg Asn He 
340 345 350 

Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu Cys Val Phe Asn 
355 360 365 

Ser He Leu Pro Gly Val Leu He Leu Phe Leu Thr Phe Phe Ala Phe 
370 375 380 

Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu Arg Phe Gly Asd 
385 390 395 400 

Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser Tyr Ser Asn Tyr 
405 410 415 

Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu Tyr Tyr Tyr Ala 
420 425 430 

Tyr Lys Asp Phe Leu Trp Phe Phe Ser Lys Arg Phe Lys Ser Ala Ala 
435 440 445 

Met Leu Ala Val Phe Ala Val Ser Ala Val Val His Glu Tyr Ala Leu 
450 455 460 

Ala Val Cys Leu Ser Phe Phe Tyr Pro Val Leu Phe Val Leu Phe Met 
4 " 470 475 480 

Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp Ser Arg Lys Lys 
485 490 495 

Pro He Trp Asn Val Leu Met Trp Thr Ser Leu Phe Leu Gly Asn Gly 
500 505 510 

Val Leu Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala Arg Arg His Cys 
515 520 525 

Pro Leu Lys Asn Pro Thr Phe Leu Asp Tyr Val Arg Pro Arg Ser Trp 
530 535 540 

Thr Cys Arg Tyr Val Phe 
545 550 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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GCCCTCCAGC TCTCTACTAA GACCGGTCGC AAGCATGCTG GGC GATAT AT CCAAACCACA 60 
CCACACATGG TCTCCCTCCT GCGT CAAAAT CTCCCCAGAC AGTCCGGACC CGCACCCGAT 12 0 

ATCCAGAATG AAACTGCACG GCTGCAGATT CAAAAGCTCC AACGCCCTCA GCGTCATCTT 180 
CGCCTGGATA TGCT GCACTC TGGTCGAACC CGTGTACTTG TGTGCTTCGC TATCATTATA 240 
GAAAATCTCC GGTGGTGCCA ACTCCTCAGG AC GT GACATT ATTTCTTCTC T GATAT ATTT 300 
CCTGTGTTTC CGTACCGCAC CTTTTTAGCA CTACTTTTTT ACTATGCTCT TCTTCTTGTG 360 
CTTCTTCTGC TTTTTTCCTC TT TATCAC AC TAT GTATGT G CTGCTCATCT CTTCTTTTTA 420 
TCGATAAAAT TGAAAAATGT GAGAT GGTGT AGAGTGAAAA AAAAAAAAAA ATCTGGCTTG 480 
GCCATCAAAT ACCCGGCCGT GGTTGGACTC GTTTAGCGAA CAATAGCACC CAGCAGACCC 54 0 

TGGCAACATG C GGAT GATAT AAGAAGGACG AGCGTGGTGG AGGAAAGGGG CGCCATTGGC 600 

ACACTCACGC AGGTGGTTGT T C AG CAC GGC TT GCAGCAAG AGC GCCAAAA CAGATT GCAA 660 

GAAT GAC GGA GACTAAGGAT TTGTTGCAAG AC GAAGAGTT TCTTAAGATC CGCAGACT C A 72 0 

ATTCCGCAGA AGCCAACAAA CGGCATTCGG T CAC GT AC GA T AAC GT GAT C CTGCCACAGG 7 80 

AGTCCATGGA GGTTTCGCCA CGGTCGTCTA CCACGTCGCT GGTGGAGCCA GTGGAGTCGA 8 40 

CTGAAGGAGT GGAGTCGACT GAGGCGGAAC GTGTGGCAGG GAAGCAGGAG CAGGAGGAGG 900 

AGTACCCTGT GGACGCCCAC AT G C AAAAGT ACCTTTCACA CCTGAAGAGC AAGTCTCGGT 960 

CGAGGTTCCA CCGAAAGGAT GCTAGCAAGT ATGT GTCGTT TTTTGGGGAC GTGAGTTTTG 1020 

ATCCTCGCCC CACGCTCCTG GACAGCGC C A TCAACGTGCC C TT CCAGACG ACTTTCAAAG 108 0 

GTCCGGTGCT GGAGAAACAG CTCAAAAATT TACAGTTGAC AAAGACCAAG ACCAAGGCCA 1140 

CGGTGAAGAC T AC GGTGAAG AC T AC GGAGA AAACGGACAA GGCAGATGCC CCCCCAGGAG 12 00 

AAAAACTGGA GTCGAACTTT TCAGGGATCT ACGTGTTCGC ATGGATGTTC TTGGGCTGGA 12 60 

T AGC GATCAG GTGCTGCACA GATTACTATG CGTCGTACGG CAGTGCATGG AATAAGCTGG 1320 

AAATCGTGCA GTACATGACA ACGGACTTGT TCACGATCGC AATGTTGGAC TTGGCAATGT 1380 

TCCTGTGCAC TTTCTTCGTG GTTTTCGTGC ACTGGCTGGT GAAAAAGCGG ATCATCAACT 14 40 

GGAAGT GGAC TGGGTTCGTT GCAGTGAGCA TCTTCGAGTT GGCTTTCATC CCCGTGACGT 1500 

TCCCCATTTA CGTCTACTAC TTT GATTTC A ACTGGGTCAC GAGAATCTTC CTGTTCCTGC 1560 

ACTCCGTGGT GTTTGTTATG AAGAGCCACT CGTTTGCCTT TTACAACGGG TATCTTTGGG 162 0 

ACATAAAGCA GGAACTCGAG TACTCTTCCA AACAGTT GCA AAAATACAAG GAATCTTTGT 1680 

CCCCAGAGAC CCGCGAGATT CTGCAAAAAA GTTGCGACTT TTGCCTTTTC GAAT TGAACT 17 4 0 

ACCAGACCAA GGAT AAC GAC TTCCCCAACA ACATCAGTTG CAGCAATTTC TTCATGTTCT 1800 

GTTTGTTCCC CGTCCTCGTG T AC C AGATC A ACTACCCAAG AACGTCGCGC ATCAGATGGA 18 60 

GGTATGTGTT GGAGAAGGTG TGCGCCATCA TTGGCACCAT CTTCCTCATG ATGGTCACGG 1920 
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CACAGTTCTT CATGCACCCG GTGGCCATGC GCTGTATCCA GTTCCACAAC ACGCCCACCT 198 0 

TCGGCGGCTG GATCCCCGCC ACGCAAGAGT GGTTCCACCT GCTCTTCGAC ATGATTCCGG 204 0 

GCTTCACTGT TCTGTACATG CTCACGTTTT AC AT GAT AT G GGACGCTTTA TTGAATTGCG 2100 

TGGCGGAGTT GACCAGGTTT GCGGACAGAT ATTTCTACGG CGACTGGTGG AATTGCGTTT 2160 

CGTTTGAAGA GTTTAGCAGA ATCTGGAACG TCCCCGTTCA CAAATTTTTA CTAAGACACG 222 0 

TGTACCACAG CTCCATGGGC GCATTGCATT T GAG C AAGAG CCAAGCTACA TTATTTACTT 22 8 0 

TTTTCTTGAG TGCCGTGTTC CACGAAATGG CCATGTTCGC CATTTTCAGA AGGGTTAGAG 2 34 0 

GATATCTGTT CATGTTCCAA CTGTCGCAGT TTGTGTGGAC TGCTTTGAGC AACACCAAGT 24 00 

TTCTACCGGC AAGACCGCAG TTGTCCAACG TTGTCTTTTC GTTTGGTGTC TGTTCAGGGC 2 4 60 

C C AGT AT CAT TATGACGTTG TACCTGACCT TATGAACTGC CACCATACCA CGTGTGTCCC 252 0 

TCGCAAGCCC TT GATAGAT A TACAATAGGG AATGGGCGTC CGTCCACCGT GGTCAAAGAC 25 8 0 
AGGGGCAAAG AGCTCCTAGG T 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Thr Glu Thr Lys Asp Leu Leu Gin Asp Glu Glu Phe Leu Lys lie 
15 10 15 

Arg Arg Leu Asn Ser Ala Glu Ala Asn Lys Arg His Ser Val Thr Tyr 
20 25 30 

Asp Asn Val lie Leu Pro Gin Glu Ser Met Glu Val Ser Pro Arg Ser 
35 40 45 

Ser Thr Thr Ser Leu Val Glu Pro Val Glu Ser Thr Glu Gly Val Glu 
50 55 60 

Ser Thr Glu Ala Glu Arg Val Ala Gly Lys Gin Glu Gin Glu Glu Glu 
65 70 75 80 

Tyr Pro Val Asp Ala His Met Gin Lys Tyr Leu Ser His Leu Lys Ser 
85 90 95 

Lys Ser Arg Ser Arg Phe His Arg Lys Asp Ala Ser Lys Tyr Val Ser 
100 105 110 

Phe Phe Gly Asp Val Ser Phe Asp Pro Arg Pro Thr Leu Leu Asp Ser 
115 120 125 
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Ala lie Asn Val Pro Phe Gin Thr Thr Phe Lys Gly Pro Val Leu Glu 
130 135 140 

Lys Gin Leu Lys Asn Leu Gin Leu Thr Lys Thr Lys Thr Lys Ala Thr 
145 150 155 160 

Val Lys Thr Thr Val Lys Thr Thr Glu Lys Thr Asp Lys Ala Asp Ala 
165 170 175 

Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser Gly lie Tyr Val Phe 
180 185 190 

Ala Trp Met Phe Leu Gly Trp lie Ala He Arg Cys Cys Thr Asp Tyr 
195 200 205 

Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu Glu He Val Gin Tyr 
210 215 220 

Met Thr Thr Asp Leu Phe Thr He Ala Met Leu Asp Leu Ala Met Phe 
225 230 235 240 

Leu Cys Thr Phe Phe Val Val Phe Val His Trp Leu Val Lys Lys Arg 
245 250 255 

lie lie Asn Trp Lys Trp Thr Gly Phe Val Ala Val Ser He Phe Glu 
260 265 270 

Leu Ala Phe He Pro Val Thr Phe Pro He Tyr Val Tyr Tyr Phe Asp 
275 280 285 

Phe Asn Trp Val Thr Arg He Phe Leu Phe Leu His Ser Val Val Phe 
290 295 300 

Val Met Lys Ser His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Trp Asp 
305 310 315 320 

He Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin Leu Gin Lys Tyr Lys 
325 330 335 

Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu Gin Lys Ser Cys Asp 
340 345 350 

Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys Asp Asn Asp Phe Pro 
355 360 365 

Asn Asn He Ser Cys Ser Asn Phe Phe Met Phe Cys Leu Phe Pro Val 
370 375 380 

Leu Val Tyr Gin He Asn Tyr Pro Arg Thr Ser Arg He Arg Trp Arg 
385 390 395 400 

Tyr Val Leu Glu Lys Val Cys Ala He He Gly Thr He Phe Leu Met 
405 410 415 

Met Val Thr Ala Gin Phe Phe Met His Pro Val Ala Met Arg Cys He 
420 425 430 

Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp He Pro Ala Thr Gin 
435 440 445 

Glu Trp Phe His Leu Leu Phe Asp Met He Pro Gly Phe Thr Val Leu 
450 455 460 
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Tyr Met Leu Thr Phe Tyr Met lie Trp Asp Ala Leu Leu Asn Cys Val 
465 470 475 480 

Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
485 490 495 

Asn Cys Val Ser Phe Glu Glu Phe Ser Arg lie Trp Asn Val Pro Val 
500 505 510 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Gly Ala Leu 
515 520 525 

His Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr Phe Phe Leu Ser Ala 
530 535 540 

Val Phe His Glu Met Ala Met Phe Ala lie Phe Arg Arg Val Arg Gly 
545 550 555 560 

Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val Trp Thr Ala Leu Ser 
565 570 575 

Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu Ser Asn Val Val Phe 
580 585 590 

Ser Phe Gly Val Cys Ser Gly Pro Ser lie lie Met Thr Leu Tyr Leu 
595 600 605 

Thr Leu 
610 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



TATAAAATTC 


CTTTCATCAA 


TACATCTATA 


TATT C GAAT A 


TATAGATAAA 


CCAATACAAA 


60 


AACATACTGA 


AATTTTTTGA 


AAACAACTAA 


AACTATTCAT 


TGCAGTTACA 


CGTGAATGCT 


120 


AAACTTTATA 


TCGCTCTTGT 


CGGTCCCGCG 


GAGTTAACAT 


TTAACGGCTT 


CTCGCGCAAT 


180 


AACCGGAAAA 


ATTCCAACAG 


TTTCTTTGTA 


ATATTATTAA 


GCCTTCTTTT 


TTCCCGGAAT 


240 


CTATAAGAGG 


GGAC GAAAAT 


TAGCCGCTAT 


TAATTCTGGT 


ATTGCCACCT 


AGACAAGAAG 


300 


TAAACAGACA 


CATTACGTTA 


GCAAAAGCAA 


CAATAACAAA 


CACAACCATG 


GACAAGAAGA 


360 


AGGAT CT ACT 


GGAGAACGAA 


CAATTTCTCC 


GCATCCAAAA 


GCTCAACGCT 


GCCGATGCGG 


420 


GCAAAAGACA 


AT CT ATAACA 


GTGGACGACG 


AGGGCGAACT 


AT AT GGGTTA 


GACACCTCCG 


480 


GCAACTCACC 


AGCCAATGAA 


CACACAGCTA 


CCACAATTAC 


ACAGAATCAC 


AGCGTGGTGG 


540 
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CCTCAAACGG AGACGTCGCA TTCATCCCAG GAACT GCTAC CGAAGGCAAT ACAGAGATTG 600 

TAACTGAAGA AGTGATTGAG ACC GAT GAT A ACATGTTCAA GACCCATGTG AAGACTTTAA 660 

GCTCCAAAGA GAAGGC AC GG TATAGGCAAG GGTCCTCTAA CTTTATATCG TATTTCGATG 720 

AT AT GT CAT T TGAACACAGG C CCAGTATAT TAGATGGGTC AGTTAAC GAG CCCTTCAAGA 7 80 

CCAAATTCGT GGGACCTACT TTAGAAAAGG AGATCAGAAG AGGGGAGAAA GAGCTAATGG 8 40 

CCATGCGCAA AAATTTACAC CAC CGCAAGT CCTCCCCAGA TGCTGTCGAC TCAGTAGGGA 900 

AAAATGATGG CGCCGCCCCA ACTACTGTTC CAACTGCCGC CACCTCAGAA ACGGTGGTCA 960 

CCGTTGAAAC CACCATAATT T CAT C C AATT TCTCCGGGTT GTACGTGGCG TTTT GGATGG 1020 

CTATTGCATT TGGTGCTGTC AAGGCTTTAA TAGACTATTA TTACCAGCAT AATGGTAGCT 1080 

TCAAGGATTC GGAGATCTTG AAATTTATGA CTACGAATTT GTTCACTGTG GCATCCGTAG 114 0 

ATCTTTTGAT GTATTTGAGC ACTTATTTTG TCGTTGGAAT ACAATACTTA TGCAAGTGGG 1200 

GGGTCTTGAA ATGGGGCACT ACCGGCTGGA TCTTCACCTC AATTT AC GAG TTTTTGTTTG 12 60 

TTATCTTCTA CATGTATTTA ACAGAAAACA TCCTAAAACT ACACT GGCT G TCCAAGATCT 1320 

TCCTTTTTTT GCATTCTTTA GTTTTATTGA T GAAAATGCA TTCTTTCGCC TTCTACAATG 1380 

GCTATCTATG GGGTATAAAG GAAGAAC T AC AATTTTCCAA AAGCGCTCTT GCCAAATACA 14 40 

AGGATTCTAT AAATGATCCA AAAGTTATTG GTGCTCTTGA GAAAAGCT GT GAGTTTTGTA 1500 

GTTTTGAATT GAGCTCTCAG TCTTTAAGCG ACCAAACTCA AAAATTCCCC AACAATATCA 1560 

GT GCAAAAAG CTTTTTTTGG TTCACCATGT TTCCAACCCT AATTTAC CAA ATTGAATATC 1620 

CAAGAACTAA GGAAAT CAGA TGGAGCTACG TAT T AGAAAA GATCTGCGCC ATCTTCGGTA 1680 

CCATTTTCTT AAT GAT GAT A GATGCTCAAA TCTTGATGTA TCCTGTAGCA AT GAGAGC AT 17 40 

TGGCTGTGCG CAATTCTGAA TGGACTGGTA TATTGGATAG ATTATT GAAA TGGGTTGGAT 18 00 

TGCTCGTTGA TATCGTCCCA GGGTTTATCG T GAT GT AC AT CTTGGACTTC TATTTGATTT 18 60 

GGGATGCCAT TTTGAACTGT GTGGCTGAAT TGACAAGATT TGGCGACAGA TATTTCTACG 1920 

GTGACTGGTG GAATTGTGTT AGTTGGGCAG ACTTCAGTAG AATTT GGAAC ATCCCAGTGC 1980 

ATAAGTTTTT GTTAAGACAT GTTTACCATA GT TCAATGAG TTCATTCAAA TTGAACAAGA 2040 

GTCAAGCAAC TTTGATGACC TTTTT CTTAA GTTCCGTCGT TCATGAATTA GCAATGTACG 2100 

TTATCTTCAA GAAATT GAGG TTTTACTTGT TCTTCTTCCA AATGCTGCAA AT GC CATT AG 2160 

TAGCTTTAAC AAATACTAAA TTCAT GAGGA ACAGAACCAT AAT C GGAAAT GTTATTTTCT 2220 

GGCTCGGTAT CTGCATGGGA CCAAGTGTCA TGTGTACGTT GTACTTGACA TTCTAAGGCA 22 80 

TCCTGCAACT GTTCTGTGGA GCTATTAAAT CTTTATAGTA AATTTTTTTT TACTTTTTTT 2340 

TTTTTTTTTT TTTTTTTTTA TTATTTACAA GCGTCTATAT ATTTTCTATT ATAGAATATT 24 00 
GTCATTTATT ACATT GGTTC A 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Amino Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asp Lys Lys Lys Asp Leu Leu Glu Asn Glu Gin Phe Leu Arg lie 
15 10 15 

Gin Lys Leu Asn Ala Ala Asp Ala Gly Lys Arg Gin Ser lie Thr Val 
20 25 30 

Asp Asp Glu Gly Glu Leu Tyr Gly Leu Asp Thr Ser Gly Asn Ser Pro 
35 40 45 

Ala Asn Glu His Thr Ala Thr Thr lie Thr Gin Asn His Ser Val Val 
50 55 60 

Ala Ser Asn Gly Asp Val Ala Phe lie Pro Gly Thr Ala Thr Glu Gly 
65 70 75 80 

Asn Thr Glu lie Val Thr Glu Glu Val lie Glu Thr Asp Asp Asn Met 

85 90 95 

Phe Lys Thr His Val Lys Thr Leu Ser Ser Lys Glu Lys Ala Arg Tyr 
100 105 110 

Arg Gin Gly Ser Ser Asn Phe lie Ser Tyr Phe Asp Asp Met Ser Phe 
115 120 125 

Glu His Arg Pro Ser lie Leu Asp Gly Ser Val Asn Glu Pro Phe Lys 
130 135 140 

Thr Lys Phe Val Gly Pro Thr Leu Glu Lys Glu lie Arg Arg Arg Glu 
145 150 155 160 

Lys Glu Leu Met Ala Met Arg Lys Asn Leu His His Arg Lys Ser Ser 
165 170 175 

Pro Asp Ala Val Asp Ser Val Gly Lys Asn Asp Gly Ala Ala Pro Thr 
180 185 190 

Thr Val Pro Thr Ala Ala Thr Ser Glu Thr Val Val Thr Val Glu Thr 
195 200 205 

Thr lie lie Ser Ser Asn Phe Ser Gly Leu Tyr Val Ala Phe Trp Met 
210 215 220 

Ala He Ala Phe Gly Ala Val Lys Ala Leu He Asp Tyr Tyr Tyr Gin 
225 230 235 240 

His Asn Gly Ser Phe Lys Asp Ser Glu He Leu Lys Phe Met Thr Thr 
245 250 255 
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Asn Leu Phe Thr Val Ala Ser Val Asp Leu Leu Met Tyr Leu Ser Thr 
260 265 270 

Tyr Phe Val Val Gly lie Gin Tyr Leu Cys Lys Trp Gly Val Leu Lys 
275 280 285 

Trp Gly Thr Thr Gly Trp He Phe Thr Ser He Tyr Glu Phe Leu Phe 
290 295 300 

Val lie Phe Tyr Met Tyr Leu Thr Glu Asn He Leu Lys Leu His Trp 
305 310 315 320 

Leu Ser Lys He Phe Leu Phe Leu His Ser Leu Val Leu Leu Met Lys 
325 330 335 

Met His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Trp Gly He Lys Glu 
340 345 350 

Glu Leu Gin Phe Ser Lys Ser Ala Leu Ala Lys Tyr Lys Asp Ser lie 
355 360 365 

Asn Asp Pro Lys Val He Gly Ala Leu Glu Lys Ser Cys Glu Phe Cys 
370 375 380 

Ser Phe Glu Leu Ser Ser Gin Ser Leu Ser Asp Gin Thr Gin Lys Phe 
385 390 395 400 

Pro Asn Asn He Ser Ala Lys Ser Phe Phe Trp Phe Thr Met Phe Pro 
405 410 415 

Thr Leu He Tyr Gin He Glu Tyr Pro Arg Thr Lys Glu He Arg Trp 
420 425 430 

Ser Tyr Val Leu Glu Lys He Cys Ala He Phe Gly Thr He Phe Leu 
435 440 445 

Met Met He Asp Ala Gin He Leu Met Tyr Pro Val Ala Met Arg Ala 
450 455 460 

Leu Ala Val Arg Asn Ser Glu Trp Thr Gly He Leu Asp Arg Leu Leu 
465 470 475 480 

Lys Trp Val Gly Leu Leu Val Asp He Val Pro Gly Phe He Val Met 
485 490 495 

Tyr He Leu Asp Phe Tyr Leu He Trp Asp Ala He Leu Asn Cys Val 
500 505 510 

Ala Glu Leu Thr Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
515 520 525 

Asn Cys Val Ser Trp Ala Asp Phe Ser Arg He Trp Asn He Pro Val 
530 535 540 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Ser Ser Phe 
545 550 555 560 

Lys Leu Asn Lys Ser Gin Ala Thr Leu Met Thr Phe Phe Leu Ser Ser 
565 570 575 

Val Val His Glu Leu Ala Met Tyr Val He Phe Lys Lys Leu Arg Phe 
580 585 590 
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Tyr Leu Phe Phe Phe Gin Met Leu Gin Met Pro Leu Val Ala Leu Thr 
595 600 605 

Asn Thr Lys Phe Met Arg Asn Arg Thr He He Gly Asn Val He Phe 
610 615 620 

Trp Leu Gly He Cys Met Gly Pro Ser Val Met Cys Thr Leu Tyr Leu 
625 630 635 640 

Thr Phe 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AT GGAGCT C A ACTTTCCCCG CTCTCCCCGC ATCCGGAAGC GCTTTCTGCT 

CTTGAGATGC TGTTCTTCAC CCAGCTCCAG GTGGGGCTGA TCCAGCAGTG 

ACCATCCAGA ACT C CAT GAA GCCCTTCAAG GACAT GGACT ACT C AC GCAT 

CTCCTGAAGC TGGCGGTCCC CAATCACCTC AT CTGGCT CA TCTTCTTCTA 

CACTCCTGCC TGAATGCCGT GGCTGAGCTC ATGCAGTTTG GAGAC CGGGA 

GACTGGTGGA ACTCCGAGTC TGTCACCTAC TTCTGGCAGA ACT G GAAC AT 

AAGTGGTGCA TCAGACACTT CTACAAGCCC ATGCTTCGAC GGGGCAGCAG 

GC CAGGAC AG GGGTGTTCCT GGCCTCGGCT TTCTTCCACG AGTACCTGGT 

CTGCGAATGT TCCGCCTCTG GGCTTTCACG GGCATGATGG CTCAGATCCC 

TTCGTGGGCC GCTTTTTCCA GGGCAACTAT GGCAAC GCAG CTGTGTGGCT 

ATCGGACAGC CAATAGC CGT C CT CAT GT AC GTCCAC GAAC TACTACGTGC 

GGCCCCAGCG GCAGAGGCCT GAGCTGCACC TGAGGGCCTG GCTTCTCACT 

ACCCGCTGCC AGAGCCCACC TCTCCTCCTA GGCCTCGAGT GCTGGGGATG 

CACAGCATCC TCCTCTGGTC CCAGGGAGGC CTCTCTGCCC TAT GGGGCTC 

CCCTCAGGGA TGGCGACAGC AGGCCAGACA CAGTCTGATG CCAGCTGGGA 

CCCTGCCCCG GGTCCGAGGG TGTCAATAAA GTGCTGTCCA GT G AG AAAAA 

AAAAAAAAAA ATTCTGCGGC CGC 
(2) INFORMATION FOR SEQ ID NO: 8: 
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GCGACGGATC 60 

GATGGTCCCC 120 

CATCGAGCGC 180 

CTGGCTCTTC 24 0 

GTTCTACCGG 300 

CCCTGTGCAC 360 

CAAGT GGATG 420 

GAGCGTCCCT 4 80 

ACTGGCCTGG 54 0 

GTCGCTCATC 600 

T CAACT AT GA 660 

GCCACCTCAA 7 20 

GGCCTGGCTG 7 80 

TGTCCTGCAC 840 

GTCTTGCTGA 900 

GAAAAAAAAA 960 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Leu Asn Phe Pro Arg Ser Pro Arg lie Arg Lys Arg Phe Leu 
15 10 15 

Leu Arg Arg lie Leu Glu Met Leu Phe Phe Thr Gin Leu Gin Val Gly 
20 25 30 

Leu lie Gin Gin Trp Met Val Pro Thr lie Gin Asn Ser Met Lys Pro 
35 40 45 

Phe Lys Asp Met Asp Tyr Ser Arg lie lie Glu Arg Leu Leu Lys Leu 
50 55 60 

Ala Val Pro Asn His Leu lie Trp Leu lie Phe Phe Tyr Trp Leu Phe 
65 70 75 80 

His Ser Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg 
85 90 95 

Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp 
100 105 110 

Gin Asn Trp Asn lie Pro Val His Lys Trp Cys lie Arg His Phe Tyr 
115 120 125 

Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Thr Gly 
130 135 140 

Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr Leu Val Ser Val Pro 
145 150 155 160 

Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Gly Met Met Ala Gin He 
165 170 175 

Pro Leu Ala Trp Phe Val Gly Arg Phe Phe Gin Gly Asn Tyr Gly Asn 
180 185 190 

/Via Ala Val Trp Leu Ser Leu He He Gly Gin Pro He Ala Val Leu 
195 200 205 

Met Tyr Val His Glu Leu Leu Arg Ala Gin Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGTTGAACT T CAT GAT GC A TGACCAGCGC ACCGGCCCGG CATGGAACGT GCTGATGTGG 60 

AC CAT GCT GT TTCTAGGCCA GGGAAT C C AG GTCAGCCTGT ACTGCCAGGA GTGGTACGCA 120 

CGGACGCACT GfcCCCTTACC CCAGGCAACT TTCTGGGGGC TGGTGACACC TCGATCTTGG 180 

T C CT GC CAT A CCTAGAGGTC GGGACAGACG ACGCTACCTG CCCAGACACC ACCAAGTTCT 240 

CTGCCTGCAA AACCTGGGGA CCAGGACTTC CTGTCTTGCA TTCCCAAATT TGGGTTCTTG 300 

AGTCGAGGCA ACCTTGCACA CAAGACCCCA CCAAGGGATT GTTGCAAGGG ATTAGATTTT 360 

GCAGATTTGT TGGGTAATGA TTCAACGACT CAGCTGGGGG TTGACCAGGG TTGATTTTTC 420 

AATCCTTTTC CCCTGGGTTT GGGTTACAGG TTTTT 4 55 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu Asn Phe Met Met His Asp Gin Arg Thr Gly Pro Ala Trp Asn 
15 10 15 

Val Leu Met Trp Thr Met Leu Phe Leu Gly Gin Gly lie Gin Val Ser 
20 25 30 

Leu Tyr Cys Gin Glu Trp Tyr Ala Arg Thr His Cys Pro Leu Pro Gin 
35 40 45 

Ala Thr Phe Trp Gly Leu Val Thr Pro Arg Ser Trp Ser Cys His Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AT GGACAACG CGGGGTCTGA TACGACTCAC TATAGGGAAT TTGGCCCTCG AGCAGTAGAT 60 

TCGGCACGAT GGGCACGAGG ACTCCATCAT GTTCCTCAAG CTTTATTCCT ACCGGGATGT 120 

CAACCTGTGG TGCCGCCAGC GAAGGGTCAA GGCCAAAGCT GTCTCTACAG GGAAGAAGGT 180 

CAGTGGGGCT GCTGCGAGCA AGCTGTGAGC TATCCAGACA ACCTGACCTA CCGAGATCTC 240 

GATTACTTCA TCTTTGCTCC TACTTTGTGT TAT GAACTC A ACTTTCCTCG GTCCCCCCGA 300 

ATAC GAGAGC GCTTTCTGCT AC GAC GAGTT CTTGAGATGC TCTTTTTTAC CCAGCTTCAA 360 

GTGGGGCTGA TCCAACAGTG GATGGTCCCT ACTATCCAGA ACTCCATGGA AGCCCTTTCA 420 

AGAGCTTCTG GCAGTTTTGG AGACCGCGAG TTCTACAGAG ATTGGTGGAA TGCTGAGTCT 4 80 

GTCAC C GACT TTTGGCAGAA CT GGAAT AT C CCCGTGG 517 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 : 

Met Asp Asn Ala Gly Ser Asp Thr Thr His Tyr Arg Glu Phe Gly Pro 
15 10 15 

Arg Ala Val Asp Ser Ala Arg Trp Ala Arg Gly Leu His His Val Pro 
20 25 30 

Gin Ala Leu Phe Leu Pro Gly Cys Gin Pro Val Val Pro Pro Ala Lys 
35 40 45 

Gly Gin Gly Gin Ser Cys Leu Tyr Arg Glu Glu Gly Gin Trp Gly Cys 
50 55 60 

Cys Glu Gin Ala Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arg Asp Leu 
65 70 75 80 

Asp Tyr Phe lie Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro 
85 90 95 

Arg Ser Pro Arg lie Arg Glu Arg Phe Leu Leu Arg Arg Val Leu Glu 
100 105 110 

Met Leu Phe Phe Thr Gin Leu Gin Val Gly Leu lie Gin Gin Trp Met 
115 120 125 

Val Pro Thr lie Gin Asn Ser Met Glu Ala Leu Ser Arg Ala Ser Gly 
130 135 140 

Ser Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Glu Ser 
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145 150 155 160 

Val Thr Asp Phe Trp Gin Asn Trp Asn lie Pro Val 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Lys Asp Leu Leu Glu Phe Leu Lys lie Arg Leu Asn Ala Asp Ala 
15 10 15 

Lys Arg Ser Thr Asp Ser Pro Thr Val Ser Glu Val Glu Arg Gly Lys 
20 25 30 

Gin Glu lie Glu Ala His Lys Ser Lys Lys Arg Phe Arg Ser Phe Ser 
35 40 45 

Phe Phe Asp Ser Phe Glu Arg Pro Ser Leu Leu Asp Gly Asn Pro Phe 
50 55 * 60 

Thr Thr Phe Gly Pro Val Leu Glu Lys Glu Lys Asn Leu His Lys Lys 
65 70 75 80 

Lys Thr Thr Val Thr Asp Val Ser Asn Phe Ser Gly lie Tyr Val Phe 
85 90 95 

Trp Met Leu Ala Leu Asp Tyr Tyr Gly Glu lie Leu Tyr Met Thr Thr 
100 105 110 

Leu Phe Thr Val Ala Asp Leu Met Phe Leu Ser Thr Phe Phe Val Val 
115 120 125 

Leu Lys Trp Thr Gly lie Ser lie Glu Phe Leu Phe lie Phe Leu Trp 
130 135 140 

Ser Arg lie Phe Leu Phe Leu His Ser Val Phe Val Met Lys His Ser 
145 150 155 160 

Phe Ala Phe Tyr Asn Gly Tyr Leu Trp lie Lys Glu Glu Leu Ser Leu 
165 170 175 

Lys Tyr Lys Glu Ser Ser Pro Leu Gin Lys Ser Cys Phe Cys Phe Glu 
180 185 190 

Leu Gin Phe Pro Asn Asn lie Ser Phe Phe Phe Phe Pro Thr Leu lie 
195 200 205 

Tyr Gin lie Tyr Pro Arg Thr He Arg Trp Tyr Val Leu Glu Lys Cys 
210 215 220 

Ala He Phe Gly Thr He Phe Leu Met. Met Ala Gin Met Pro Val Ala 
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225 230 235 240 

Met Arg Asn Phe Trp Gin Leu Leu Asp lie Pro Gly Phe Val Leu Tyr 
245 250 255 

Leu Thr Phe Tyr lie Trp Asp Ala Leu Asn Cys Val Ala Glu Leu Thr 
260 265 270 

Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Val Ser 
275 280 285 

Phe Ser Arg lie Trp Asn Val Pro Val His Lys Phe Leu Leu Arg His 
290 295 300 

Val Tyr His Ser Ser Met Phe Lys Leu Lys Ser Gin Ala Thr Leu Thr 
305 310 315 320 

Phe Phe Leu Ser Ala Val Val His Glu Ala Met Val lie Phe Arg Tyr 
325 330 335 

Leu Phe Phe Gin Gin Met Ala Leu Asn Thr Lys Phe Arg Arg lie Asn 
340 345 350 

Val Phe Trp Gly Cys Gly Pro Ser Val Thr Leu Tyr Leu Thr 
355 360 365 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu Phe His Ser 
15 10 15 

Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg Glu Phe 
20 25 30 

Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp Gin Asn 
35 40 45 

Trp Lys He Pro Val His Lys Trp Cys He Arg His Phe Tyr Lys Pro 
50 55 60 

Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Asp Arg Gly Val 

65 70 75 80 

Pro Gly Pro Ser Ala Phe Phe His Val Val Thr Trp Val Ser Val Pro 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGGGGACGA AAATTAGC CG CTATTAATTC TGGTATTGCC AC CTAGACAA GAAGTAAACA 
GACACAGATG CAAGAGTTCG AATCTCTTAG C 
(2) INFORMATION FOR SEQ ID NO : 1 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
C T AT AAAGAT TTAATAGCTC CACAGAACAG TTGCAGGATG CCTTAGGGTC GACTACGTCG 
TAAGGCCGTT TCTGAC 

(2) INFORMATION FOR SEQ ID NO; 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CATTGCAGTT ACACGTGAAT GC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TAG C T C C AC A GAACAGTTGC AGG 23 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTCTGACAAC AACGAAGTCA G 21 
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What is claimed is: 

1 . An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyltransf erase II. 

5 

2. An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyltransf erase III. 

3. The isolated nucleic acid of claim 1 or 2, wherein 
10 the nucleic acid is DNA or RNA. 

4. The isolated nucleic acid of claim 3, wherein the 
nucleic acid is cDNA or genomic DNA. 

15 5. The isolated nucleic acid of claim 1 comprising a 
nucleic acid having the sequence as set forth in 
Figure 15. 

6. The isolated nucleic acid of claim 1, wherein the 
20 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyltransf erase II having 
substantially the same amino acid sequence as set 
forth in Figure 15. 

25 7. The isolated nucleic acid of claim 2, comprising a 
nucleic acid having the sequence as set forth in 
Figure 16. 

8. The isolated nucleic acid of claim 2, wherein the 
3 0 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyltransf erase III having 
substantially the same amino acid sequence as set 
forth in Figure 16. 

35 9. The isolated nucleic acid of claim 1 comprising a 
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nucleic acid having the sequence designated Seq. 
I.D. No.: 11. 

10. The isolated nucleic acid of claim 1, wherein the 
5 nucleic acid encodes a mouse wildtype acylcoenzyme 

A: cholesterol acyltransf erase II having 
substantially the same amino acid sequence as the 
sequence designated Seq. I.D. No.: 12. 

10 11. The isolated nucleic acid of claim 1, wherein the 
nucleic acid encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II. 

12. The isolated nucleic acid of claim 2, wherein the 
15 nucleic acid encodes a mutant acylcoenzyme A: 

cholesterol acyltransf erase III. 

13. A vector comprising the isolated nucleic acid of 
claim 1 or 2 . 



0 



14. The vector of claim 13 further comprising a promoter 
of RNA transcription operatively linked to the 
nucleic acid. 

15. The vector of claim 14, wherein the promoter 
comprises a bacterial, yeast, insect or mammalian 
promoter . 

16. The vector of claim 14, further comprising plasmid, 
cosmid, yeast artificial chromosome (YAC) , 
bacteriophage or eukaryotic viral DNA. 

17. The vector of claim 14 designated YEpAB-ACAT2 . 



5 18. 



A host vector system for the production of a 
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polypeptide which comprises the vector of claim 14 
in a suitable host, 

19. The host vector system of claim 18, wherein the 
suitable host is a prokaryotic or eukaryotic cell. 

20. The host vector system of claim 19, wherein the 
prokaryotic cell is a bacterial cell. 

21. The host vector system of claim 19, wherein the 
eukaryotic cell is a yeast, insect, plant or 
mammalian cell. 



22. A method for producing a polypeptide which comprises 
15 growing the host vector system of claim 18 under 

suitable conditions permitting production of the 
polypeptide and recovering the polypeptide so 
produced. 

20 23. A method of obtaining a polypeptide in purified form 
which comprises: 

(a) introducing the vector of claim 14 into a 
suitable host cell; 

(b) culturing the resulting cell so as to produce 
25 the polypeptide; 

© recovering the polypeptide produced in step 
(b) ; and 

(d) purifying the polypeptide so recovered. 

30 24. A purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II. 

25. A purified mutant acylcoenzyme A: cholesterol 
acyltransf erase II . 

35 
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26. A purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase III . 

27. A purified mutant acylcoenzyme A: cholesterol 
5 acyltransf erase III. 

28. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within a nucleic 

10 acid which encodes a wildtype acylcoenzyme A: 

cholesterol acyltransf erase II without hybridizing 
to a nucleic acid which encodes a mutant 
: acylcoenzyme A: cholesterol acyltransf erase II. 

15 29. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II without hybridizing 

20 to a nucleic acid which encodes a wildtype 

acylcoenzyme A: cholesterol acyltransf erase II. 

30. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 

25 sequence of nucleotides present within a nucleic 

acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransf erase III without hybridizing 
to a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransf erase III. 

30 

31. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 

35 cholesterol acyltransf erase III without hybridizing 
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to a nucleic acid which encodes a wildtype 
acylcoenzyme A: cholesterol acyltransf erase III. 

32. The oligonucleotide of claim 28, 29, 30 or 31 
wherein the nucleic acid is DNA or RNA. 

33. A nucleic acid having a sequence complementary to 
the sequence of the isolated nucleic acid of claim 
1 or 2. 



34. A method for determining whether a subject known to 
have an imbalance in sterol levels has the imbalance 
due to a defect in esterif ication of sterol which 
comprises : 

15 (a) obtaining from the subject an appropriate 

sample containing a mixture of all of the 
subject's nucleic acids; and 
(b) determining whether any nucleic acid in the 
sample from step (a) is, or is derived from, a 

20 nucleic acid which encodes a mutant 

acylcoenzyme A: cholesterol acyltransf erase so 
as to thereby determine whether the subject's 
imbalance in sterol levels is due to a defect 
in esterif ication of sterol. 

25 

35. The method of claim 34, wherein the determining of 
step (b) comprises: 

(I) contacting the sample of step (a) with the 

isolated nucleic acid of claim 11 or 12 or 

30 the oligonucleotide of claim 29 or 31 

under conditions permitting binding of any 
nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes 
a mutant acylcoenzyme A: cholesterol 

35 acyltransf erase to the nucleic acid or 
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(ii) 



oligonucleotide so as to form a complex; 
isolating the complex so formed; and 



(iii) 



identifying the nucleic acid in the 
isolated complex so as to thereby 
determine whether any nucleic acid in the 
sample contains a nucleic acid which is, 
or is derived from, a nucleic acid which 



encodes a mutant acylcoenzyme 
cholesterol acyltransf erase II or III. 



A: 



10 



15 



20 



25 



30 



36. The method of claim 35, wherein the. isolated nucleic 
acid or the oligonucleotide is labeled with a 
detectable marker. 

37. The method of claim 36, wherein the detectable 
marker is a radioactive isotope, a fluorophore or an 
enzyme. 

38. The method of claim 35, wherein the nucleic acid 
sample is first bound to a solid matrix before 
performing step (I). 

39. The method of claim 35, wherein the sample comprises 
blood or sera. 

40. A method for treating a subject who has an imbalance 
in sterol levels due to a defect in esterif ication 
of sterol which comprises introducing the isolated 
nucleic acid of claim 1 or 2 into the subject under 
conditions such that the nucleic acid expresses a 
wildtype acylcoenzyme A: cholesterol acyltransf erase 
II or III, so as to thereby treat the subject. 

41. A method for inhibiting wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III in a subject 



BNSDOCID: <WO 9745439 A 1 __!_> 



WO 97/45439 




PCT/US97/09460 



-82- 

which comprises transforming appropriate cells from 
the subject with a vector which expresses the 
nucleic acid of claim 33, and introducing the 
transformed cells into the subject so as to thereby 
inhibit wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III. 



42. The method of claim 41, wherein the nucleic acid of 
claim 33 is capable of specifically hybridizing to 
!0 a mRNA molecule encoding acylcoenzyme A: cholesterol 

acyltransf erase II or III so as to prevent 
translation of the mRNA molecule. 



43. A method for inhibiting the wildtype acylcoenzyme A: 
15 cholesterol acyltransf erase II or III in a subject 

which comprises introducing the oligonucleotide of 
claim 28 or 30 into the subject so as to thereby 
inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III. 

20 

44. The method of claim 43, wherein the oligonucleotide 
of clam 28 or 30 is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme 
A: cholesterol acyltransf erase II or III so as to 

25 prevent translation of the mRNA molecule. 

45. A method for identifying a chemical compound which 
is capable of inhibiting acylcoenzyme A: cholesterol 
acyltransferase II or III in a subject which 

30 comprises: 

(a) contacting a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III with the 
chemical compound under conditions permitting 
binding between the acylcoenzyme and the 

3 5 chemical compound; 
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(b) detecting specific binding of the chemical 

compound to the acylcoenzyme; and 
<£> determining whether the chemical compound 
inhibits the activity of the coenzyme so as to 
5 identify a chemical compound which is capable 

of inhibiting acylcoenzyme A: cholesterol 
acyltransf erase II ir III in a subject. 

46. A pharmaceutical composition comprising the chemical 
10 compound identified by the method of claim 45 in an 

amount effective to inhibit acylcoenzyme A: 
cholesterol acyltransf erase II or III in a subject 
and a pharmaceutical^ effective carrier. 

15 47. A method of treating a subject who has 

atherosclerosis comprising administering the 

pharmaceutical composition of claim 4 6 to the 
subject . 

20 48. A method of treating a subject who has 

hyperlipidemia comprising administering the 

pharmaceutical composition of claim 4 6 to the 
subj ect . 

25 49. A transgenic, nonhuman mammal comprising the 
isolated nucleic acid of claim 1 or 2 . 
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